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Preface 


This textbook is based on my lectures to students at the Mos- 
cow Regional Pedagogical Institute (1945 to 1948) and Moscow 
University from 1945 on. In writing this book we set ourselves 
the difficult task of treating in a single volume the fundamentals 
of atomic theory, that is, Schrbdinger^s nonrelatlvistlc theory, 
Dirac’s relativistic theory, the theory of multielectron atonis, 
and the basic applications of quantum mechanics to solid state 
physics. Our aim was to combine the exposition of general the- 
oretical principles with examples of the application of quantum 
mechanics to specific problems connected with atomic structure. 
To avoid overloading this book, we have abridged the treatment of 
certain specialized topics, but in such cases we have endeavored 
to supply references to standard works on the subject. 

In most textbooks the solution of specific problems with the 
help of Schrodinger’s equation is handled in fairly elegant form. 
The basic mathematical tools required for this purpose are a 
knowledge of second-order differential equations and various spe- 
cial functions (including the Hermite, Legendre and Laguerre 
polynomials). However, applications of Dirac’s theory to specific 
problems (such as the hydrogen atom) are on the whole handled 
less satisfactorily. In some cases the calculations are so long 
and cumbersome that it is difficult to perceive the physical mean- 
ing of the solutions. In others there is no actual derivation of the 
results or only a rough proof is given. In an attempt to avoid 
these pitfalls, we have used an approximate form of Dirac’s equa- 
tion for our treatment of the hydrogen atom (Chapter 19). This 
approximation still enables us to obtain the formula for the fine 
structure of the energy levels and the selection rules (Chapter 18 
and 20). Our analysis of the Lamb shift due to the electron- 
positron vacuum is also somewhat simplified (Chapter 22). 

Several good problem books in quantum mechanics are avail- 
able, and therefore we shall consider only a few problems chosen 
with the aim of elucidating and supplementing the general discus- 
sion. 

The first part of this book was written jointly by me and Yu. M. 
Loskutov, and the second part jointly by me and I. M. Ternov. 
Great assistance was rendered by M. M. Kolesnikova in condens- 
ing notes based on my lectures on quantum mechanics and in 
preparing the manuscript for the press. Chapter 25 was carefully 
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read by N. N. Kolesnikov, who made a number of valuable com- 
ments. I would like to mention the great pains taken by S. I. Larin 
In editing the whole manuscript. 


A. A. Sokolov 



Introduction 


Quantum mechanics dates only from the 1920*s. This important 
branch of theoretical physics deals with the fundamental problem 
of the behavior of microparticles (for instance, the behavior of 
electrons in an atom). As a theo.y, quantum mechanics represents 
an extension of classical mechanics, electrodjmamics (including 
the theory of the electron and the theory of relativity) , the kinetic 
theory of matter, and other branches of theoretical physics. 

Historically, the development of every branch of theoretical 
physics involves two main stages. First comes the accumulation of 
experimental facts, the discovery of semiempirical laws, and the 
development of preliminary hypotheses and theories. This is fol- 
lowed by the discovery of general laws, which provide a basis for 
interpreting a large number of phenomena. For example, the first 
or pre-Newtonian stage of mechanics consisted of the discovery of 
a number of seemingly unrelated laws: the law of inertia, the law 
of free fall under the action of a gravitational field and Kepler’s 
laws of planetary motion. Most of these laws were discovered only 
after years of painstaking work by many scientists. Thus, many 
astronomical observations preceded the discovery of Kepler’s 
laws. We may recall the great efforts of Copernicus, Bruno, Gal- 
ileo, and others to establish that the Sun is the center of our 
planetary system and that the Earth is only a planet like Mars, 
Venus, or Jupiter. It was only after working for fifteen years on 
Tycho Brahe’s extremely valuable observational data that Kepler 
found the semiempirical laws describing planetary motion. After 
these preliminary, seemingly independent laws had been estab- 
lished, Newton was able to show that they all rested on the same 
theoretical foundation. Newton’s three laws of motion and the law 
of universal gravitation opened a new stage in the development of 
theoretical mechanics. One of the great triumphs of Newtonian 
mechanics was Leverrier’s prediction of the existence of a new 
planet, Neptune, from perturbations in the motion of Jupiter. 

In a similar fashion, Maxwell’s formulation of the laws of elec- 
trodynamics was preceded by the discovery of empirical laws de- 
scribing various electric and magnetic phenomena. Coulomb’s 
law of interaction between electric charges and magnetic poles^ 
and the Biot-Savart law of interaction between an electric current 
and a magnetic pole were found by analogy with Newton’s law of 


^As magnetic monopoles do not exist in nature, Coulomb’s law in magnetostatics is 
verified by means of magnetic dipoles. 



xvi 


INTRODUCTION 


gravitation. All of these phenomena were explained on the basis of 
the principle of ‘‘action at a distance,’’ according to which one 
charge acts directly on another through the intervening space. 

After Newton, and independently of investigations of electric 
and magnetic phenomena, considerable attention was devoted to 
optics. At a relatively early stage, it was established that light 
consists of transverse waves propagating with a finite velocity 
of cm/ sec. The nature of these waves, however, re- 

mained imknown. 

All of these preliminary studies belonged to the first stage of 
development of electrodynamics: they prepared the ground for 
Maxwell’s theory, which had approximately the same unifying role 
in electrodynamics as Newton’s laws in mechanics. Maxwell’s 
equations provided a powerful tool for the investigation of electric, 
magnetic and optical phenomena. Maxwell’s theory predicted the 
existence of electromagnetic fields, which carry the interaction 
continuously from point to point, and of electromagnetic waves, 
which were later discovered by Hertz. The theory of propagation 
of electromagnetic waves underlies all of modern radio engineering. 

Another important result of Maxwell’s theory was aproof of the 
wave nature of light. 

The view that matter and electricity have an atomic structure 
was of considerable Importance in connection with the appearance 
of quantum mechanics. This view had very ancient roots, but 
remained without scientific foundation until the discovery of the 
fundamental law of chemistry— the law of exact proportions. The 
kinetic theory of matter and, in particular, the kinetic theory of 
gases— based on the classical Maxwell- Boltzmann statistics— were 
important steps in the development of atomic theory. It is worth 
noting that the classical Maxwell- Boltzmann statistics, which rests 
on probability theory, cannot be completely explained in terms of 
Newtonian mechanics and contains certain features that are char- 
acteristic only of large collections of particles (for example, the 
irreversibility of certain processes). Statistical methods made it 
possible to e^qplain a number of macroscopic properties of matter, 
such as temperature and specific heat; this provided indirect 
evidence of the atomic structure of matter. 

One of the decisive proofs of the atomic theory of matter was 
the discovery of fluctuations, that is, statistical fluctuations in the 
behavior of individual molecules. Brownian motion was particularly 
important in this connection, as it provided the evidence of molec- 
ular movement in a liquid. Even more suggestive proofs of the 
atomic structure of matter were provided by Laue’s observation of 
the diffraction of x-rays in crystals and Aston’s mass-spectro- 
graphic measurements of the atomic weights of individual isotopes 
of various elements. 

From an analysis of Faraday’s laws of electrolysis, Helmholtz 
showed that there must be a fimdamental quantity of electricity. 
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equal to 4.8- 10 esu, such that any charge, positive or negative, 
is an integral multiple of this charge. Studies of anode rays indi- 
cated that positive charges always appear as ions; that is, a positive 
charge is always associated with the basic mass of an atom. The 
lightest positive ion is that of a hydrogen atom. It is known as a 
proton and its mass is nearly the same as the mass of a neutral 
hydrogen atom. The carrier of a negative charge can take the form 
of a negative ion or of a much lighter particle known as an electron. 
From measurements of the deflection of cathode rays (a beam of 
electrons) in electric and magnetic fields, it was found that the 
mass of the electron was about 1/1836 of the mass of the proton.^ 

These discoveries led to Lorentz^s electron theory, which 
represents an interesting synthesis of MaxwelFs electrodynamics 
for a vacuum and the atomic view that matter consists of positive 
and negative charges. In Lorentz^s theory the magnetic permea- 
bility, dielectric constant, and conductivity of a medium were 
obtained by averaging Maxwell’s equations for a vacuum over 
charges and currents of particles of the medium. A conductor was 
treated as a medium filled with free electrons or, in other words, 
an ‘‘electron gas.” It followed from Lorentz’s theory that the 
dielectric constant depends on the frequency of electromagnetic 
waves, whereas in Maxwell’s theory it had been assumed that this 
quantity is a constant. Lorentz’s theory provided an explanation of 
the dispersion of light. The appearance of this theory was accom- 
panied by the extension of electrodynamics to frames of reference 
traveling with constant relative velocities; this culminated in the 
special theory of relativity. It is well known that all laws of motion— 
whether they be Newton’ slaws or Maxwell’s equations for the motion 
of an electromagnetic field— must be associated with a frame of 
reference. Newton believed that his laws were related to an abso- 
lute frame of reference. Even in his writings, however, this notion 
remained purely metaphysical, and Newton himself discovered the 
principle of relativity in mechanics according to which it is im- 
possible to detect a uniform rectilinear motion of a body (or a 
frame of reference) relative to this absolute system, because all 
frames of reference moving linearly with constant relative velocities 
with respect to each other are completely equivalent. 

Consider the Galilean- Newtonian transformation from one in- 
ertial system to another, moving along the x axis with relative 
velocity v 


x' = x — vt, y'=y, z' = z, 


2 

In 1932, a particle with a positive charge and mass equal to that of an electron, known 
as the positron, was discovered Positrons are formed in small quantities when cosmic 
rays pass through matter. Under ordinary conditions a positron cannot exist for any sig- 
nificant length of time because it combines with an electron and the two particles are 
converted into gamma-ray photons (see Chapters 3 and 22). 
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where the primed coordinates refer to the moving system, and the 
xmprimed coordinates to the stationary system. We find that 
accelerations and forces are identical in the two frames of refer- 
ence, and therefore the equations of mechanics (in which the 
velocity does not appear) are invariant under this transformation. 
If the Galilean- Newtonian transformation is applied to the Maxwell- 
Lorentz equations, they assume different forms in different inertial 
systems, because the equations contain the velocity of propagation 
of electromagnetic waves which, added vectorially, has different 
values in different inertial systems. The original Mlchelson- Mori ey 
and other numerous experiments showed, however, that the speed of 
light is the same in any direction in all inertial coordinate systems. 
As a result, Einstein generalized the Newtonian principle of rela- 
tivity in a way that led directly to the so-called Lorentz trans- 
formations 


x'= 


X — vt 


tf—y, z'= 2 , 


f 




V i-P* ’ 


P = 


— 

C • 


The classical laws of electrocfynamics are Invariant under this 
transformation. Since the equations of Newtonian mechanics, 
however, are not invariant under the Lorentz transformations, 
they had to be replaced by relativistic equations in which the 
mass m of a moving particle was related to its velocity v and 
its rest mass moby the relationship 


At low velocities, where the relativistic equations reduce 

to the Newtonian formulations. 

The Maxwell- Lorentz equations for an electromagnetic field 
and the relativistic equations of motion of an electron constituted 
the culminating stage of the classical electron theory. According 
to this theory, light consists of electromagnetic waves and an 
electron is a particle whose motion is described by relativistic 
mechanics. The success of the Maxwell- Lorentz theory in account- 
ing for certain microscopic phenomena (the propagation and dis- 
persion of light, the motion of an electron in electric and magnetic 
fields, and so forth) was accompanied by the discovery of exper- 
imental facts that could not be explained with classical concepts. 
These experiments will be described in Chapters 1—3, and there- 
fore we shall mention them here only veiy briefly. 

In the first place, it was found that black-body radiation, the 
photoelectric effect, and the Compton effect could be ejqplained only 
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on the basis of corpuscular properties of light. This was the im- 
plicit assumption of the Planck-Einstein photon theory, in which 
the discrete structure of light was described in terms of Planck^s 
constant h = 6.62-' 10“^’ erg. sec. The photon theory was also suc- 
cessfully used by Bohr in constructing the first quantum theory of 
the atom, based on the planetary model suggested by Rutherford. 
In the second place, a number of experimental facts, including the 
electron diffraction, indicated that in addition to its corpuscular 
properties, an electron has wave properties. De Broglie's def- 
inition of the wavelength of an electron also included Planck’s con- 
stant h. This led eventually to the development of a new science- 
electron optics— which provides a theoretical basis for electron 
microscopy. 

The Schrddinger wave equation (1926) was the first general 
theoretical treatment that explained both of these classes of phe- 
nomena and unified the preliminary theories of Planck, Einstein, 
Bohr, and de Broglie. This equation made it possible to discover 
the laws of behavior of electrons and other elementary particles 
and to construct a relatively systematic theory of radiation that 
took into account the quantum nature of light. For atomic physi- 
cists, the Schrddinger equation was one of the most powerful tools. 
Many phenomena associated with the behavior of an electron in an 
atom and with the absorption and emission of light by an atom were 
provided with a theoretical explanation (see Chapters 4-14). The 
later development of quantum theory showed that the Schrbdinger 
equation did not describe all the properties of atoms. In particu- 
lar, it could not explain correctly the interaction of an atom with a 
magnetic field (for instance, the anomalous Zeeman effect) and it 
could not be used to construct a theory of multielectron atoms. 
One of the main reasons for this was that the Schr’ddinger theory 
did not take into account the electron spin. 

Dirac’s relativistic theory (see Chapters 15-17) was an exten- 
sion of the Schrodinger theory that considered relativistic and spin 
effects of moving electrons (see Chapters 18-20). It turned out that 
the quantitative corrections due to relativistic effects were rela- 
tively small, but that spin effects were of fundamental importance 
in connection with the fine structure of multielectron atoms (see 
Chapters 23 and 24). These effects explained the filling of electron 
shells in an atom and gave a theoretical basis to Mendeleyev’s 
periodic table of elements (see Chapters 25 and 26). 

Although the fundamental problems related to the structure of 
the atom were basically solved with the appearance of Dirac’s 
equation, we are constantly adding further details to our knowl- 
edge. At present a great deal of attention is being devoted to the 
influer ^te of the electron- positron vacuum and magnetic moments 
on the energy levels of atoms (see Chapters 21 and 22). Quantum 
mechanics has also been applied to simple molecules (Chapter 27), 
solid state physics (Chapter 6), and the atomic nucleus. 
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Chapter 1 


The Quantum Theory of Light 


Before the supremacy of classical physics was challenged by 
the advent of quantum mechanics in the beginning of this century, 
particle motion was sharply distinguished from wave motion. 
According to the classical picture, the world consisted of particles 
(for example, electrons and ions) and fields (for example, light). 
This picture was completed by MaxwelPs theory (1873), which 
appeared to have definitely established that fields had wave-like 
properties. 

Towards the end of the nineteenth century and in the first years 
of this century, this state of affairs was disturbed by the discovery 
of experimental facts that did not fit into the classical conceptual 
framework. On the one hand, there were certain phenomena, such 
as the radiation spectrum of an ideal black body, the photoelectric 
effect, and the Compton effect, which could be understood only in 
terms of particle-like properties of light. On the other hand, elec- 
trons were observed to have wave-like properties, such as diffrac- 
tion, which later served as a basis for the development of electron 
optics. 


A. PRINCIPLES OF THE ELECTRON THEORY 

The behavior of the electromagnetic field produced by a given 
distribution of charge and current is described by the well-known 
Maxwell-Lorentz equations 

^ u I dE Anov 

+ = ( 1 . 1 ) 

V •£=4Ttp, 

V-H=0, 


where £ and Hare the electric and magnetic field intensities, re- 
spectively, p is the charge density (for example, of the electron), 
and o is its velocity. 
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To start with, an electromagnetic field transmits interactions 
between the charges. The interaction between stationary charges 
e is transmitted by an electrostatic field which satisfies Coulomb^s 

law (^V whereas the interaction between moving charges 

is transmitted by a system of electric and magnetic fields, since 
a moving charge can be regarded as an electric current and it is 
well known that a current interacts with a magnetic field. Elec- 
tromagnetic fields are always associated with sources of the 
appropriate type (for example, charges). 

Secondly, electromagnetic fields may be regarded as electro- 
magnetic waves, which propagate with the velocity of light c (radio 
or light waves). As a particular example of a source of light waves, 
we can take an accelerated charge. The latte remits radiant energy; 
per unit time, this energy is 


where e is the charge, and w its acceleration. Once electromagnetic 
waves have been produced, they can exist independently of their 
sources. 

The equation describing the propagation of a light wave is 
obtained from Eqs. (1.1) by setting the charge density p equal to 
zero. We can then eliminate the vector H from the second Maxwell 
equation by taking the curl of this equation and substituting into it 
the first equation. Since V£'.~0, we can use the vector relation 

VxVx£= V(V-£)— V*£= — V*£, 

to obtain the following wave equation, which holds for the compo- 
nents of both vectors £ and H: 

where / is any component of the vectors £ or//. 

A more detailed analysis of the Maxwell- Lorentz equations 
shows that electromagnetic waves are transverse. This means that 
the electric field intensity (£) and magnetic field Intensity (//) are 
mutually orthogonal, and also orthogonal to the wave vector k, 
which points in the direction of propagation of the electromagnetic 
wave. The vectors form a triad such that when a right-handed 
screw is turned from £ to H, it moves along the direction of k : 

Hz=/i'‘xE, (1.4) 


where ft" = kjk is a unit vector. 
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A charge (say, an electron) moving In externally applied electric 
and magnetic fields ejqjeriences a force 

F=e[E+^VxH), (1.5) 

which is called the Lorentz force. 

Taking into account the relativistic variation of mass, the 
equation of motion of an electron in an external field has the form 

( 1 - 6 ) 

where 


We can select a Lagrangian fimction X in such a manner that 
the variational principle 


Xi, t)dt = Q, (1.7) 

or, in more explicit form, 


= 0 2, 3). (1.8) 

will yield the equation of motion of an electron (1.6). Here;^, =Ar, 
x,i=y^ X'i = z are the spatial coordinates, and the jti denote the 
corresponding velocities. To obtain (1.6), we must set 

^ / 1 ~ P* — -|- jVA, (1.9) 

where A and o are the vector and scalar potentials of the electro- 
magnetic field. These potentials are related by the Lorentz condi- 
tion 

^•'* + 7 ^= 0 . ( 1 . 10 ) 


The electric and magnetic field intensities can be expressed in 
terms of A and3> by means of the relations 


£== — V® — 1^, 
c dt ' 

H=S/xA. 


( 1 . 11 ) 


We find the following expression for the electron momentum: 
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Similarly, for the generalized force acting on the electron, we 
obtain 




(1.13) 


Substituting (1.12) and (1.13) into (1.8) and taking into account (I.IO) 
and (1.11), we obtain Eq. (1.6) for the motion of an electron. Thus, 
our choice of the Lagrangian is justified. 

Since we know we may also determine the Hamiltonian //: 

It is well known that the Hamiltonian should not be expressed in 
terms of the velocity cp, but in terms of the generalized momentum 

P=p — ^A, which, according to (1.12), is related to cp by the 

equation 


c’P’ + mJc* 


c’mjo* 

l-P* 


my= 


mjc* 
1-P’ • 


Therefore, the relativistic form of the Hamiltonian is 

H= 


(1.14) 


We note that if the potentials are time- independent^ the Hamiltonian 
is equal to the total energy ( E = // )^ 

In the nonrelativistic approximation (P^m^c), Eq. (1.14) can be 
written in the form 


H = H' 


where //' is approximately equal to the nonrelativistic part of the 
Hamiltonian 


From this it can be seen that the relativistic equation for the 
Hamiltonian (1.14) also includes the rest-mass energy m,c* of the 
electron. It is very Important to take this rest-mass energy into 
account in stuc(ying transformations of elementary particles. 
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B. THE CLASSICAL THEORY OF BLACK-BODY RADIATION 

Among all the phenomena associatedwith electromagnetic fields, 
special importance can be attached to the properties of cavity radia- 
tion. This can be described as thej radiation inside a cavity, com- 
pletely surrounded by o p aq u e wa fl a^ which are heated to a certain 
cpAsfaSritSQp^^ alternatively, as radiation uLeaullfeT 

rium with an i s othe r mal enclosurCg^ A small hole made in the wall 
of~such encl osure be haves lik e an ide^blacR body, because 
practicaily ^jo ^terhal rays incident on the openi n g are r eflected. 
In otherword sr essentially all rays entering the cavity Throu gh the 
E ble ~afe~absorbed. or. more precisely, the probability that th^v 
reemerga ^ from the _hole_is negligib ly small. Consequently*. the 
aavit v__ radiation escapln g ^rom the hole mav be regarded a.s the 
radiation which would b e obtaingd from an idea l bl^k body, a n d it 
is generally referred tolis^^Black-body radiation. 

The analysis of black-body radiation played a particularly im- 
portant role in the foundation of quantum theory. Although a more 
or less reasonable classical explanation could be found for many 
other phenomena, every single theory of black-body radiation con- 
structed on the basis of classical concepts failed to agree with the 
experimental facts. A systematic theory of black-body radiation 
was developed only in the beginning of this century, when Planck 
introduced the concept of a quantum of energy. This concept 
was later to play an important role in the development of the first 
quantum theory of the atom, and afterwards in the development of 
quantum mechanics. 

We shall now consider the theory of black-body radiation, con- 
fining ourselves for the time being to classical concepts. Let the 
radiation be characterized by its spectral density Po,,' which is 
related to the ordinary electromagnetic energy density 

« = (1.16) 

by the relation 

P- = S-. (1.17) 


In the literature, the function pi^ is sometimes used for the spectral density. The 
function p^) is related to pi, by the equation 


1 

Pco - pi/ > 

2tt 


since O) = 27^1^. 
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where du is the energy density of the radiation in the frequency 
range from w to (o + do). Obviously, 


oo 

« = |p<„da). (1*18) 

On the basis of the second law of thermodynamics, Kirchhoff 
showed that the density p„ is determined only by the temperature 
of the walls of the closed cavity and is entirely independent of the 
material of which the walls are made; that is, p^=f{w, T), 

Consequently, the walls of the cavity may be considered as a set of oscillators. The 
average energy of these osciUators is completely determined by the spectral density of 
the black-body radiation. We shall show this starting with the equation of motion for an 
oscillator and taking into account Planck’s radiation damping 

Here e and nto are the charge and rest mass of the oscillator, a> is its natural frequency 
of oscillations, and Ex Is the x component of the electric field intensity of the black- 
body radiation. 

Representing Ex in the form of a Fourier series 

00 

n*=— 00 


where Exn is the amplitude of an individual oscillation of the field with frequency 

= ( 1 . 21 ) 


we obtain the following equation forx(t) from Eq, (1.19): 

_L p Jntoot 
mo 


-2 


oo <0 — (n»«) +( -g — 


( 1 . 22 ) 


The average energy of the oscillator, which, according to the virial theorem, is twice 
the average kinetic energy, is given by 


E = moJt^ = mo 


„=_oo <o»— («“o)- + I J 


(1.23) 


where the bar denotes averaging over time. Since 

n — — 00 n, n'™— oo 
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where /_;j =/«» since, moreover. 


£ 

(n— n') ^ JL J («—/*') -- | 


I for n' = n, 
P for n' n, 


(U24) 


where x = 27c/a)o, Eq, (1.23) can be reduced to the form 


moJf* = 2mo 2 






This equation has a very sharp maximum in the neighborhood of frequency o> , and 
hence the total energy of the oscillator will actually depend only on those terms of the 
series for which n<oo ^ Consequently, in the above equation, the square modulus of the 

amplitude | Exn 1^ can be replaced by| |, where /to — — , and at the same time the sum 
can be changed to an integral. According to (1.21),2 


=; <aodn = (Oq = — j 
Uq 


since = 1. Therefore, we obtain 


E = 2 


riQe^ 


nto 


■'xnp I i 


(0* diOn 


J r 2 

(T {a)J — to®}* +<4 

' " ^ “\3 moc®/ 


(U5) 


(1.26) 


Replacing the frequency <•>„ by <o everywhere except in the difference — introduc- 
ing the variable of Integrauon S = — w, and extending the limits of integration to 

±; GO, we find 


E = 2 


npg® 

mo 




s 

— oo 



2 <0® 


(1.27) 


On the other hand, the energy density u, which is related to the electromagnetic field 
of the radiation by Eq, (1.16), can also be expressed in terms of | |», Since the radia- 

tion is isotropic we have, on the basis of Eq, (L16), 

« = i (PTTn = 1 (£i + £» + £.), (U8) 


^The reader should not feel uneasy about our equating the differential d(0„ to the finite 
quantity cOq = 277/ r, for it is always possible to make the period r sufficiently large so that 
the relationship ojq o) will be satisfied. Mathematically, this corresponds to a transit 
tion from Fourier series to Fourier integral. 
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Using the expansion (1*20) and the rule (1.24) for averaging over t , we obtain 


00 00 



n=— oo 0 


Hence, taking into account Eq. (1.18) together with the relationship 


dn = — ^ = /io 
0)0 


0) » 


fortaf^ = 0 ) (/t = /2o), we obtain 


Pa» 


3/Io i ^xnp 

2it <0 * 


(1.29) 


(1.30) 


(Ul) 


Comparing Eqs. (1.31) and (1. 27), we find the relationship between 
the average energy E=mQX^ of the oscillator and the spectral 
density of the radiation 


Pa> 






( 1 . 32 ) 


which forms the basis of the theory of black-body radiation. 

In classical statistical physics, the energy distribution of 
particles is given by the function 


N (E) = Ae- 


( 1 . 33 ) 


where a= l/JfeT; 1.38 • 10"‘“ erg.deg** is Boltzmann’s constant, 
and T is the temperature of the medium. The average energy of the 
particles is 


E 


A J Ee-'^^dE 



4- In J .-■«<(£ = 


= t\na. = kT. 

da 


( 1 . 34 ) 


Substitutii^ this value of E into Eq. (1.32), we obtain the Rayleigh- 
Jeans formula 


?<.==- 


( 1 . 35 ) 
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This equation satisfies the Wien^s thermodynamic law 

P»=<«7(t). (1.36) 

which was based on various results In thermocfynamlcs and the 
electromagnetic theory of light. In the region of long wavelengths 
(low frequencies), the Raylelgh-Jeans formula Is In good agree- 
ment with experimental data. At short wavelengths, however. It 
completely falls to agree with experiment (see Fig. 1.1). 



Fig. 1.1. Radiation spectrum of an ideal black body. 

The heavy dotted line indicates the Rayleigh-Jeans 
curve pcj - PqX^ , and the solid line the Planck curve 
pco PqX^ /( e^ — 1), which is the same as the experi- 
mental curve. Here pQ — {kT)^ , O) = OJqX, and 
coq = kT/H. 

In exactly the same way, the use of the Rayleigh-Jeans formula 
for calculation of the radiation energy density [see Eq. (1.18)] re- 
sults in a divergent integral, that is, we obtain the obviously absurd 
relationship 


u = J p,d«) = f U)> du) = oo. (1,37) 

0 ^ 0 

This was called the ^‘ultraviolet catastrophe^^ by Ehrenfest. Thus, 
the classical theory was completely unable to give a satisfactory 
description of black-body radiation. 

C. PLANCK'S EQUATION 

In 1900, Planck put forward an important hypothesis which re- 
moved the ultraviolet catastrophe and radically changed a number 
of fundamental principles of classical physics. According to this 
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hypothesis, the energy of microscopic systems (atoms, molecules, 
and so forth) does not vary continuously and assumes only certain 
specific discrete values. In particular, the energy of a harmonic 
oscillator must be a multiple of a certain minimum value e: 

E = m. (1.38) 


where rt = 0, 1, 2 .... 

In order to determine the average value of the energy, we must 
replace the integral (1.34) by the sum 


00 



(1.39) 


Substituting this value of E into Eq. (1.32), we obtain the spectral 
density of the radiation 

P*""” I * (1.40) 


We can bring this equation into agreement with Wien’s thermo- 
dynamic law by letting s be proportional to < 0 : 

£ = ^0). (1.41) 

We then obtain P/ancfe's equation 


P«) ^ > (1.42) 

which was a brilliant achievement of quantum theory. 

The quantity fi= 1.05 • 10“^'^ erg. sec, which has the dimensions of 
action, is called Planck's constant.^ 

At low frequencies i j, the exponential may be expanded 

in the form of a power series in fi^;kT . Restricting ouselves to the 
linear terms of the e^qpansion, we have 


In the literature, Planck*s constant is more often taken as the quantity h = 27Tfi - 
6.6249 • 10 erg • sec, which relates the energy € to the frequency v: 


€ = hv . 
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Thus, Planck’s equation (1.42) reduces to the Rayleigh-Jeans 
formula (1.35). 

In the case of high frequencies (^3' we may neglect the 1 in 
the denominator of Eq. (1.42) and write in the form 


Po, 




g—fi(ofkr^ 


(1.43) 


Planck’s equation (1.42), which describes the dependence of the 
spectral density of thermal radiation on the frequency to, is in 
excellent agreement with experiment (see Fig. 1.1), 

From Eqs. (1.42) and (1.18), we can find the total radiation 
density 


oo oo 

r , n c cfo) 


(1.44) 


Introducing the variable ^ = fio)lkT and considering that 

C Vdi __ n' 

J eS— 1 “15’ 

0 

we obtain the well-known Stefan-Boltzmann law'* 


k*T* 


(1.45) 


where 


“ = iS£-= 7.56 • 10-" erg • cm ''’, deg '*. (1.46) 

From Eq. (1.42), it can be seen that the spectral density of 
black-body radiation has a maximum at some value of oi and that 
the position of this maximum changes with temperature. The 
equation governing the position of this majjj^um is called Wien’s 
displacement law. More often, Wien’s displacement law is expressed 


'^Usually it is not the density u which is measured, but the energy aT"* which is radiated 
per second per square centimeter of the black body's surface within a solid angle of 271 

(ii - 277). In this case, the Stefan-Boltzmann constant is G ~ 277 — a ~ — a ~ 5.67 • 10 

8 77 4 

erg • cm ^ • deg ^ • sec * 
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in terms of the spectral distributionwith respect to the wavelengths 
X . To determine we can use the expression for u 


Since X=r2TC/(B, transforming to the spectral distribution over the 
frequencies, we have 


CO CO 

0 0^ 

from which we find 

2nc 

Px X® * (1.48) 

X5 

To determine the wavelength X^ax at which px has its maximum, 
we set '5-^ = 0: 

OA 



2Ttctl 




kPAj, 


2nch 

kl'K 

e max — ] 


= 0 . 


Setting 2^c1llkT\^^ = ij^ we obtain the equation 




whose solution can be given with good accuracy in the form 

i/;^5(l— e-») = 4.965. 

Thus Xj^^ is relatedjlll the temperature T by the equation 


S 2ncfl 

max' 


= 6 = 0.29 cm -deg, 


(1.49) 


which expresses Wien's displacement law, and where b is the 
Wien’s constant. According to this law, as the temperature of an 
ideal black body increases, the maximum of the radiation intensity 
is shifted towards shorter wavelengths (see Fig. 1.2). 
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Equations (1.46) and (1.49) relate Planck’s constant ft and Boltz- 
mann’s constant k to the constants a and b. 

Knowing the numerical values of a and b , we can determine 
ft and k. This is the way in which a numerical value was first 
obtained for ft and a better value found for k. 



Fig. 1,2. Wien's displacement law 
Curves of the spectral energy distribution as 
a function of temperature for an ideal black 
body: A„ax'*’ ^ 0 29 cm . deg 


Recapitulating, it follows from Planck’s hypothesis that proc- 
esses such as emission and absorption Involve discrete quanta. In 
other words, the energy change of particles involved in these proc- 
esses is discontinuous and not smooth as would follow from the 
laws of classical physics. 


D. EINSTEIN’S PHOTON THEORY 

In deriving his equation, Planck assumed that the energy of the 
oscillators is quantized. Theoriglnal version of the theory, however, 
does not provide any physical justification for this property. Indeed, 
Planck himself chose to attribute the “special properties” to the 
heated body rather than to the electromagnetic radiation. 

The second Important step towards the Ag^lopment of quantum 
theory consisted of Einstein’s hypothesis that oscillators absorb 
and emit radiation in discrete amounts because electromagnetic 
radiation Itself consists of discrete particles, called photons, which 
carry an amount of energy ftu). In effect, Einstein Interpreted 
Planck’s equation as a description of the corpuscular properties 
of light. 

We shall now attempt to develop an elementary theory of 
photons. 
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According to classical theory, the energy of a light wave is 

e = -I J (E^ + W) d\x = L J F <Px, (1.50) 

where (Px is an element of volume, and the integration extends over 
all space. The electromagnetic momentum n of the light wave in 
classical theory is 




EH(Px, 


(1.51) 


According to Eq. (1.4), this can also be written in the form 

(1.51a) 


Comparing (1.51a) and (1.50), we find the relation between the 
momentum n and the energy e : 

« = I . (1.52) 


In the theory of relativity, a similar relation between energy 
and momentum holds for particles with zero rest mass and it can 
easily be obtained from Eq. (1.14) by substituting 
and >1 = 0. 

From these considerations, Einstein concluded that an electro- 
magnetic field can be considered as a set of particles called photons, 
with zero rest mass and the energy 

£ = ^0). (1.53) 

For the photon momentum the following equation is obtained: 

n = k'>^ = k0j==ilk, (1.54) 


where /z = , and k = is the wave vector = is the wave 

number). 

Chi the basis of these concepts, in 1905, Einstein constructed a 
quantitative theory of the photoelectric effect, which had been dis- 
covered by Hertz in 1887. What is observed in the photoelectric 
effect is the following: the potential difference required for a 
spark to jump between two small charged spheres is reduced if the 
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cathode is illuminated. To explain this phenomenon, Einstein 
postulated the simple equation 

= (1.55) 

This is essentially the law of conservation of energy and indicates 

that the kinetic energy of the ejected electron is equal to the 

difference between the energy of the absorbed photon fiw and the 
work function W of an electron in the metal. It is obvious that if 
electrons cannot be ejected from the metal. Only if the 
energy of the incident photons exceeds W can electrons leave the 
metal. 

The experimental verification of Einstein^ s theory of the photo- 
electric effect provided strikingconfirmationof his basic conclusion 
that the energy of the ejected electrons depends only on the fre- 
quency of the incident light and not on its intensity, and that the 
emission of photoelectrons begins only when the frequency of 
light 0 ) exceeds a certain limit (so-called threshold frequency) 



The _ implications of the photon theory were brought out and 
verified in 1923 by experiments on the scattering of x-rays by 
free electrons (tte Compton effect). 

Tlie Compton effect was, particularly r 

interesting because it confirmed not '"n. 

pnly the law of conservation of energy 

(which was already verified by the 
photoelectric effect) but also the law 

of conservation of momentum. ' | of Ught by a free 

It IS well known that, in classical 
theory, the frequency of light does not 

change when it is scattered by a free electron (io' = w). By contrast, 
in quantum theory, part of the photon^s energy e = ^u) is transferred 
to the electron (see Fig. 1.3). Consequently, the energy and 
frequency of a scattered photon should generally be somewhat 
smaller (e'<e, . To find the dependence of frequency on the 

scattering angle, let us write the laws of conservation of energy 
and momentum, treating the photons as particles: 


Au) — ftu)' = (m — mo), 
Ilk — tik' — mv. 


(1.56) 


Here and m==mo/Kl — represent the mass of the electron 
before and after collision; v is its velocity; p==i;/c; Hk==tli)iilc and 
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hk'^fiio'lc represent the momentum of the photon before and after 
scattering. We rewrite Eq. (1.56) in the form 

c* 

0) — u)' = — mo), 

" (1.56a) 

Taking the square of these equations and subtracting the first 
equation from the second, we obtain 

0 ) 0 j' (1 — cos 0) == (cu) — CU)'). (1.57) 


Substituting X = 2 t:c/(o, X' = 21 :^ /(o', and dividing (1.57) by coto', we find 
an expression for the increase in wavelength of the scattered light 

AX = X' — X = 2X,sin*y, (1.58) 

where Xo is the Compton wavelength of the electron 


Xo- 


2nh 

WqC 


== * =2.4- lO-'" cm, 

nhc 


We therefore see that, according to quantum concepts, the wave- 
length of the scattered light X' must be greater than the initial wave- 
length X(V^X) since (o'<;(o. This difference increases with the 
scattering angle 0. Since the Compton wavelength Xo is relatively 
small, Compton scattering is observed at relatively short wave- 
lengths ( x-rays and gamma rays ). Indeed, for visible light 
(X^ 10 ® cm) 

T-~T~'0'’=10-’Vo. 


whereas for x-rays (X^ 10 10“* cm) 

~ 10 ' = 10 »/„. 

Therefore, the Compton shift can be observed experimentally only 
in the second case. 

In his experiments, Compton studied the scattering of radiation 
from an x-ray tube by graphite and other substances (lithium, 
beryllium, sodium, potassium, iron, nickel, copper, and so on) at 
different angles 1>. The spectral distribution of the intensity of the 
scattered radiation at different scattering angles was measured by 
means of an ionization chamber. 
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Figure 1.4 shows the spectral distribution of incident and 
scattered waves. If the incident wave (upper curve) has one maxi- 
mum, the scattered wave (lower curve) will have, in addition to 
this maximum, a second maximum at a longer wavelength. The 



Wavelength 



Fip. I 1-. Spectral distribution of x-rays in 
the Compton effect before (upper curve) 
and after (lower curve) scattering 


distance between the wavelengths of the two maxima must corre- 
spond to the Compton shift; this is because the distance increases 
with the scattering angle, and, in addition, it does not depend on 
the type of scattering material [both these facts are in accord 
with Eq. (1.58)^]. The unshifted maximum corresponds to scatter- 
ing by electrons which are strongly bound to the nucleus (or 
more precisely, electrons whose binding energy is greater than 
the energy of the x-ray quanta). The shifted maximum corresponds 
to scattering by electrons which are so weakly bound to the nucleus 
that, in practice, they can be regarded as free. 

Thus the results of Compton’s experiments completely confirm 
the quantum nature of light (that is, the photon theory). 


^Only the intensity of the maxima depends on the type of scattering substance. As the 
atomic weight of the scattering substance increases, the intensity of the unshifted maxi- 
mum increases, and that of the shifted maximum decreases. 


Chapter 2 


The Bohr Quantum Theory 


A. BASIC INFORMATION ON PROPERTIES OF ATOMS 

A theory of the atom was developed only after reliable experi- 
mental data had been obtained from studies of the effects described 
below* 

1) Emission of light by atoms. From careful studies of the 
radiation of atoms, it was established that they have bright-line 
spectra and that the lines are arranged in certain definite series. 
For example, all the lines of hydrogen are described by Balmer^s 
formula^ 

( 2 . 1 ) 

where R is the Rydberg constant, and n'and n are Integers. Setting 
/i' = 1 and n = 2, 3, 4, , . . , we obtain the Lyman series, which lies in 
the ultraviolet part of the spectrum. For n'— 2 and n == 3, 4, 5, 
we have the Balmer series, which is located in the visible part of 
the spectrum and is, therefore, easiest to study. 

Formula (2.1) can also be written in the form of a difference 
between two quantities 

= (2-la) 


I 


In spectroscopy, it is customdry to write rialmer’s formula in the fomi 


1 

A 



where the Rydberg constant for hydrogen is l< 109,677. b cm ^ The value of the Rydberg 
constant in Eq (2 1) is related to H by the equation 

H 2rrcR^p 2rr • 3 29- 10^ '^ sec ^ - 20 66- 10^ '* sec ^ 

Equation (2.1) Js not too convenient for spectroscopial use, since it is usually the wave- 
lengths which are experimentally determined. In developing a theory, we do not need to 
take this reservation into account, since we can always transform from the frequency to to 
the wavelength A 

1 to 

A 277 C 
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which are called spectral terms. For the hydrogen atom, these 
terms are given by 



This possibility of representing the radiation frequencies « as a 
difference between two terms is aconsequenceof the Ritz combina- 
tjon principle, which has important spectroscopic applications in 
regard to the hydrogen atom, as well as more complex atoms. For 
example, hydrogen was initially found to have two series, corre- 
sponding to n'=\ (the Lyman series) and to /i' = 2 (the Balmer 
series). On the basis of the Ritz combination principle,^ a third 

series was predicted with /i' = 3 and /i = 4, 5, 6 This series was 

later discovered by Paschen in the infrared region of the spectrum. 

2) The behavior of an atom in external electric and magnetic 
fields and, in particular, the interaction of the atoms of a substance 
with fast particles passing through it. The most important experi- 
ments in this area were conducted by Rutherford, who succeeded 
in finding the distribution of positive charges inside the atom from 
the analysis of fast-a -particle scattering. 

3) Finally, investigation of various properties of molecules 
provided important data pertaining to the properties of atoms. For 
example, the formation of simple homopolar molecules and the 
valence theory found their explanation only on the basis of the 
modern quantum theory of the atom. 


B. THE CLASSICAL MODEL OF THE ATOM 

Once it had been established that an atom consists of a positively 
charged part associated with most of the mass, and of light, 
negatively charged electrons, attempts were made to construct a 
static model. The reason this approach to the problem was adopted 
is that, in classical electrodynamics, an accelerated electron emits 
radiation, the amount of energy emitted per unit time I)eing 

w = — (2.2) 

dt 3 e> ’ 

2 

The Ritz combination principle was first formulated as folbiws if there are two 
different frequencies belonging to the same senes, the difference between these fre- 
quencies is also a frequency which can also be emitted by the atom, but bidongs to another 
senes. The concept of "terms”, permits a relatively simple explanation of this. Indeed 


Hence 


and thus the Ritz combination principle leads directly to Eq (2 la) 
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where e = — eo is the 
elementary charge), w 



Fig. 2.1. Thomson’s model of 
the hydrogen atom (Z 1) 
The positive charge Zf’Q is 
uniformly distributed over 
the volume of a sphere of 
radius Hq The electron (with 
charge -cq) is located at a 
distance x from the center of 
the atom 


electron charge (^o==4,80 • 10~*®esu is the 
is the acceleration of the electron, and c is 
the velocity of light in vacuum. The minus 
sign in front of dE/dl shows that the energy 
of the electron decreases as a result of 
the emission of radiant energy. Since an 
atom does not radiate in the ground state, 
it follows from the classical theory that 
the charges in the atom should be at rest. 

The most interesting classical model 
was that of Thomson, according to which 
the positive charge uniformly filled the 
entire atomic volume, and the electronic, 
that is, negative point charges were lo- 
cated inside the atom. 

For example, in the hydrogen atom, the 
positive charge was supposed to fill uni- 
formly a sphere of radius Ro (see Fig. 
2.1). The charge density inside the sphere 
was (for Z==l) 


In the ground state, the electron was supposed to be located at the 
center of the sphere, where the electric field is zero. At a distance 
r==.v<^i?o from the center, the electric field E is directed along 
the radius and its intensity can be found from Gauss ^ law: 


Hence 




epr^ 

Rr 



(2.3) 


Therefore, if an electron with charge e = — and mass mo is 
placed at a distance x from the center of the atom, it experiences 
a quasi-elastic attractive force towards the center 


F== — ^oE== — 

with this force, the differential equation describing the motion of 
the electron is 

^-fa)5jf=0. (2,4) 

The solution of this equation is 


j: = /1cos(V + ?o). 
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where 



Substituting for too the fundamental frequency observed in the Balmer 
series, we obtain a very reasonable value for the radius of the 
atom, namely, cm. This value is many times greater 

than the classical radius of the electron 

r,==^,^2.8. 10-‘» cm. 

TTIqC* 

The Thomson model agreed completely with the classical 
Lorentz theory, according to which atoms can be represented as 
harmonic oscillators. Unfortunately, the Thomson model could not 
explain the regularities of the line spectra of atoms and, in particu- 
lar, the spectral series of hydrogen that are described by Eq. (2.1). 
Indeed, from the standpoint of classical theory, the Thomson model 
could emit radiant energy only at the fundamental frequency u)© or, 
at best, at its harmonics 




(2.5) 


where n = 1, 2, 3,... 

The decisive blow to the Thomson model was dealt by the 
experiments of Rutherford, who showed that the positive charge is 
not distributed throughout the entire volume of the atom, but is 
concentrated virtually at one point. Nevertheless, the Thomson 
potential Inside a nucleus of finite dimensions, with a charge 
uniformly distributed through the volume, is 


(j) — 

Ro 


(I 



( 2 . 6 ) 


the charge of the nucleus is Zcq, the electric field intensity inside the nucleus is 

^ __ ^^0 ^ r 

Rq dr r ' 

from which, using the boundary condition that at r = Rq potential is the same as for a 
point charge 


^r=Ro 


ZCq 


«0 ' 


we obtain Eq. (2 6), 
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does play an important role, especially when corrections for the 
volume of the nucleus must be made. Moreover, when mesic atoms 
with large Z are formed (in a mesic atom, an electron is replaced 
by a negative n meson, whose mass is 207 times larger than the 
mass of the electron), there may be states in which the negative n 
meson is always inside the nucleus. In this case, the motion of the 
ju meson will be determined mainly by the potential (2.6), The apn 
propriate equations of motion will, however, be quantum-mechanical, 
rather than classical (see Chapter 21). 


C. RUTHERFORD’S EXPERIMENTS AND CONCEPTS OF 
ATOMIC STRUCTURE 

Our present model of the atom is based on the famous experi- 
ments conducted by Rutherford in 1911 on the passage of alpha 
particles through matter. It is well known that alpha particles, 
which are products of nuclear disintegration, possess a sufficient 
energy to penetrate into an atom. At the time when Rutherford 
conducted his experiments, there was no other source which could 
produce sufficiently heavy particles (that is, particles with a mass 
comparable to the nucleus) with an energy great enough to penetrate 
the atom. 

By passing alpha particles through thin sheets of metal (foil), 
Rutherford showed that most of the alpha particles which pass 
through the foil are scattered through relatively small angles (2-3®) 
from their initial direction of motion. Within the framework of 
the Thomson model, these small deflections could be explained in 
terms of the statistical theory of random processes, because of 
the relatively weak interaction between the atoms and the alpha 
particles. Rutherford and his co-workers, however, also detected 
individual deflections of alpha particles through very large angles 
of up to 180°, Although the number of these deflections was very 
small (for example, when a beam of 8000 primary alpha particles 
from RaC is passed through platinum foil, at most one particle is 
deflected through an angle greater than 90°), it was, nevertheless, 
much larger than the number which could be predicted on the basis 
of superposition of a large number of small random deflections. 

Large scattering angles were also observed when alpha particles 
were passed through a gas. These could be easily seen in photo- 
graphs taken in a Wilson cloud chamber. 

From a general analysis of his e^qjeriments, Rutherford estab- 
lished, first, that atoms are fairly transparent to alpha particles 


4 

In earlier experiments, Rutherford and his co-workers had established that alpha 
particles have the same mass as the helium atom, and a positive charge which is twice 
the magnitude of the electron charge It is now known that alpha particles are the nuclei 
ot helium atoms 
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(that is, their structure is relatively '‘open^^); and, second, that 
large deflections can take place only if a very strong electric field 
exists inside the atom. This electric field must be produced by a 
positive charge which is associated with a large mass and con- 
centrated in a very small volume. We note in this connection that, 
according to Eq. (2.6), the largest field produced by a nucleus of 
radius Rq is 


To explain these results, Rutherford proposed a planetary 
model of the atom in which the structure of the atom resembles 
a planetary system. A positively charged nucleus constituting 
almost the entire mass of the atom is concentrated at the center 
in a very small volume of radius -10 and charged 

electrons move about this nucleus in closed orbits like planets 
around the Sun. We note that the potential energy of the Newtonian 
attraction between a planet of mass m and the Sun (of mass M) 

‘^Newt r * 

where X is the gravitational constant and has the same form as the 
potential energy of the Coulomb attraction between an electron and 
a nucleus 



From this model, Rutherford developed a quantitative theory of 
scattering. His calculations were based on the assumption of a 
Coulomb interaction between the alpha particles and the nucleus. 
The influence of the atomic electrons was neglected in the first 
approximation, since their energy is considerably lower than the 
energy of the bombarding particles. 

Let us find, following Rutherford, the trajectory of an alpha 
particle moving in the field of an infinitely heavy point nucleus 
having a charge Ze^, Our calculations will be carried out in a 


the fmiteness of the nuclear mass A/nut taken into account, the nucleus has a 
certain recoil (like that of the alpha particle) as a result of the interaction In this case, 
all the calculations must be performed in the center-of-mass system and, m the results 
obtained for the case M^uc ' U is necessary to replace the mass of the alpha particle 
Mq by the reduced mass 




^^O^^nuc 
Mq t 


(see Chapter 12, Section C for a discussion of the reduced mass) 



26 


NONRELATIVISTIC QUANTUM MECHANICS 


coordinate system whose origin coincides with the nucleus (see 
Fig. 2.2). Since the field produced by the nucleus is centrally 
symmetric, in determining the trajectory of the alpha particles we 
can use both the law of conservation of energy 

E = const. (2.7) 

and the law of conservation of angular momentum 

L = My(r y ©)= const, (2.8) 

where is the mass of the alpha particle, r is its coordinate, 
and V is its velocity. 



Fig. 2.2. Diagram for the derivation of Ruther- 
ford’s formula for the cross section of elastic 
scattering of alpha particles by nuclei. 


Let us introduce the polar coordinates r and f. The velocity of 
the particle is given by 

= ry, (2.9) 

where vii=/‘ and vi =r^ are the components of velocity parallel 
and perpendicular to the radius vector r, respectively, andr=™ 
and cj) =^1 , We then obtain, instead of Eqs. (2.7) and (2.8), 

£ = ^ -J- 1/ r= + rY) + = const, (2.10) 

L^ = Mq (rxv)^ — MQr'^^ = const. (2.11) 

In the absence of interaction, the alpha particle would pass the 
nucleus at a distance b (this distance b is called the impact para- 
meter). Setting the initial velocity equal to (that is, the velocity 
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^_^„oo and as follows from Fig. 2.2), then (2.10) and 

(2.11) can be reduced to the form 

(A* + rV) 4- = £o, (2.12) 

I I = Motoo, (2.13) 

where the initial energy Eg is related to the initial velocity tz# by 
the equation 



Introducing the new variable 



and noting that then, according to (2.13), 


and 



= v^hu'‘. 



\vM\, 


where «' = ^, we transform Eq. (2.12) to 


«" + «’ + 




(2.14) 

(2.15) 

(2.16) 

(2.17) 

(2.18) 


Differentiating this equation with respect to tp, we obtain 

+ « + = (2.19) 

Hence 

« = /lcos? + 5sin(p (2.20) 

The unknown constant coefficients A and B can be determined 
from the initial conditions 


lim /*= lim — = oo, (2.21) 

y-.It y-»it ^ 


and 


lim/'sin9= lim = 

y It y r 


( 2 . 22 ) 
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Setting cp^TT and u 0 in Eq. (2.20), we obtain 


rzei 


(2.23) 


and, consequently, applying the condition (2.22) to Eq. (2.20), we 
have 

B = (2.24) 

Thus, we finally obtain 


" = ;^(l + coscp). 


(2.25) 


This equation gives the relationship between the absolute value 
of the radius vector r and the polar angle cp, and thus describes the 
trajectory of the alpha particle in the Coulomb field of the nucleus. 
It is an equation for a hyperbolic trajectory in polar coordinates. 
By definition, the scattering angle ^ is equal to the angle 
cp = at which the length of the radius vector r becomes 

infinite (that is, from Fig, 2,2, « = y = 0). Therefore, from (2,25), 

we find 


^ _Mobvl __ bEo 

cot 2 ’22^3 ' /el * 


(2,26) 


It follows that the scattering angle increases as b decreases, 
attaining 180° for = 0 (see Fig. 2.3). 


(b-^) 



Kip. 2 3 Dependence of the scattering 
angle d on the impact parameter h, where 

hi V />2 


Equation (2.26) can be checked experimentally by photographing 
the tracks of alpha particles in a Wilson cloud chamber. From the 
maximum scattering angle, it is possible to compute the minimum 
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value of b , which turns out to be of the order of the nuclear radius. 
The actual form of interaction between an alpha particle and a 
nucleus can, however, be determined more accurately from an 
investigation of the cross section for the scattering of alpha particles 
by nuclei. For this purpose, Rutherford calculated the relative 
number of particles scattered at an angle 0 or, to be precise, the 
number of scattered particles which we would expect to find within 
the solid angle 

d!ii = 2TCsin0di>. (2.27) 


Suppose N particles impinge per unit time on a unit surface 
placed perpendicularly to the original velocity of the particles. 
From Eq. (2.26), it follows that the scattering angle depends only 
on the impact parameter h. For a particle to be scattered through 
an angle it must strike a ring formed by two circles with radii 
b 2 indb — db. The area of this ring is 2Kb db. Therefore, the number 
of particles which hit this area and then, as a result of scattering, 
are found within the solid angle d2 is 

dM==:N-2K\bdbl (2.28) 

From Eq. (2.26), we obtain for the relative number of particles 
scattered through an angle d 

^ = (2.29) 

The ratio dM/N has the dimensions of area and is called the differ- 
ential cross section. It is usually denoted by do. Taking the deriva- 
tive of cot^^ in Eq. (2.29), we obtain the Rutherford formula for 
elastic scattering of alpha particles by a Coulomb center: 


do 




(2.30) 


This formula no longer depends on parameter 6. 

If all the quantities in this equation are kept constant, with the 
exception of 0, we would expect the following equation to hold: 

dosin‘| = (^)'d2 = const. (2.31) 

For 0 — TC, it is found, however, that the quantity do ceases to 

be constant and begins to decrease somewhat. This fact was 
explained by Blackett, one of Rutherford’s students, who studied 
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the limits of applicability of the Coulomb law. Blackett took a large 
number of photographs of tracks of particles in a Wilson cloud 
chamber and calculated the frequencies with which the various 
scattering angles occur. From analysis ofthe experimental data, he 
established that the number of particles observed at large scatter- 
ing angles [according to (2.26), large angles correspond to small 
values of the parameter h ] is markedly smaller than the number 
yielded by the formula (2.30). On this basis, Blackett concluded 
that, in air, for example. Coulomb’s law is valid down to dis- 
tances of the order of 3* 10 cm. At smaller distances, there is a 
deviation from Coulomb’s law. Indeed, at cm, the 

interaction between the alpha particle and the nucleus appears to 
take the form of a strong mutual attraction. Further experimental 
investigations have confirmed the existence of a characteristic 
attractive force at distances less than 10 cm. This attraction 
drops off rapidly with increasing distance from the nucleus. 

The Rutherford formula (2.30) can be used to find the number 
Z from an experimental determination of dN and N, This was 
undertaken by Chadwick, another of Rutherford’s students. Chadwick 
showed that the value of Z is very close to the atomic number, 
which gives the element’s position in Mendeleyev’s periodic table. 
The existence of this phenomenon was rigorously proved at a later 
date. 

Thus the experiments of Rutherford and his colleagues definitely 
established the planetary model of the atom. These experiments 
proved that the positive charge of the electron is concentrated in 
a nucleus with dimensions of 10'*® — 10"*'^ cm, and that, inside the 
atom. Coulomb forces keep the electrons moving in orbits with 
a radius of the order of 10 ® cm. 

D. THE BOHR THEORY 

First of all, let us attempt to develop Rutherford’s planetary 
model of the atom on the basis of classical theory. We shall re- 
strict ourselves to the case of a hydrogen-like atom, namely, an 
atom with a nucleus of charge Ze^, with a single electron (of 
charge e = — eo) moving around it. Particular examples of hydrogen- 
like atoms include hydrogen (Z=l), ionized helium (Z = 2), and 
so forth. 

Introducing the polar coordinates rand cp(x = rcoscp, y = r sin (p), 
we obtain the following equations for the kinetic energy and the 
Coulomb potential energy: 


r=-^(^‘+rV). 


y — 
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Then the Lagrangian Is 

where mo is the mass of the electron.® 

From this Lagrangian, we obtain the following equations for 
the motion of the electron: 


Here 


and 


-M. 

dt d<f 

- dX 
liP' — 5r 


= 0 , 
= 0 , 


Pr=^ — nhf 


(2.33) 


(2.34) 


represent the generalized momenta associated with the cp and r 
coordinates, respectively. Since cp does not occur explicitly in 
X (in other words, it is a cyclic coordinate), it follows that 

= Therefore, the corresponding generalized momentum is a 
constant of the motion 


Py = = const . (2.35) 

This is the law of conservation of angular momentum, which is well 
known from classical mechanics. The second conservation law, 
namely, the law of conservation of energy 

£ = r+l^ = const (2.36) 

is obtained from the condition that the time t does not occur ex- 
plicitly in the Lagrangian. 

We shall consider only the simplest case, that of circular orbits, 
for which /^— -0, Accordingly, p^ = m^f vanishes, and from (2.33) we 
have 


dr 





(2.37) 


®In the following chapters, the mass of the electron in nonrelativistic theory shall be 
denoted by rriQ, since m will be used for the magnetic quantum number. 
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Hence 




Zej 

Wor* * 


Therefore, we obtain for the energy of the electron 


p Jjh __ ^^0 L ‘?£i _L 1/ 

^ — 2 ^ r 2 " r ‘“"2 


(2.38) 


(2.39) 


Let us now express the basic parameters of the atom in terms 
of the adiabatic invariants, which were introduced by Ehrenfest. 
According to Ehrenfest’ s definition, in the case of periodic motion, 
the quantities 


^ Pi dx^ = li (2.40) 

(pi is a generalized momentum, and Xi a generalized coordinate) 
remain constant during slow (adiabatic) changes of the parameters 
of the system (for instance, the charge). 

In our case, there is only one degree of freedom (A:, = (p). The 
conditions (2.35) and (2,40) yield the equation 

Pf=m^r^ = -^, (2.41) 

or 

^ = (2.42) 


From Eqs. (2.38) and (2,42), we can obtain expressions for r and 
in terms of the adiabatic invariant: 


r 


? = too 


P 

4n^moZel * 

_ Sn^moZ^e^ 
“■ /« 


(2.43) 

(2.44) 


According to Eq. (2.39), the energy of the electron is then 

B = — (2.45) 


It follows that the frequency of mechanical oscillation Vo is given 
by the derivative of the energy with respect to the adiabatic in- 
variant: 


_dE __ 4n^moZ*et 
di ~ /• 


(2.46) 
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This relation holds not only for the case under consideration, but 
also for any periodic or quasi-periodic motion.^ Moreover, a 
system performing any periodic motion can generally emit radia- 
tion not only at the first harmonic k=\, but also at the harmonics 
/2 = 2, 3, 4, . . , . The following expression is, therefore, obtained for 
the classical frequency of radiation: 


v = H = (2.46a) 

The classical theory of the planetary model of the atom led to 
a number of difficulties when it was used to explain the radiation 
of atoms. Since this model is dynamic, it follows from classical 
electrodynamics that the centripetal acceleration of the electron 

= where v is its velocity, and a is the radius of its orbit) 
will cause the electron to lose energy at the rate 

dE 2 e^w* 

~ If ~~T c* 

until it falls into the nucleus. This is not, however, what actually 
takes place, and atoms can exist in a non- radiating state for an ar- 
bitrarily long time. Moreover, according to the classical theory, 
the frequency of the radiation should be the same as the frequency 
of mechanical oscillation (the fundamental frequency (o == = 2irvo ) 

or, alternatively, an integral multiple of this frequency (one of the 
harmonics = moo , where = 2, 3,4, . . .). Once again, this predic- 
tion does not explain Balmer^s experimentally established formula 
(2.1) for the lines in the spectrum of radiation. 


^We define periodic motion as a motion in which a point returns to its initial position 
after a certain period of time. As examples of this, we can take harmonic motion 

X - a cos (i)t 


or motion about an ellipse 


X ^ a cos cot and v - b sin 0)t , 

a special case of which is motion about a circle (a - h). Quasi-periodic motion is motion 
in which a point does not return, as a rule, to its initial position, but each individual 
coordinate reassumes its initial value after a certain period of time (which is different 
for each of the coordinates). As an example of quasi-periodic motion, we can take 

X - a cos coit , 
y - b cos c02t , 


where the frequency is incommensurable with 0 ) 2 - 
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The solution to these difficulties was found in 1913 by Niels 
Bohr, who added two postulates to the classical laws of motion. 
First, Bohr assumed that each atom has a series of discrete 
stationary states in which the electron does not emit radiation, even 
though its motion is accelerated (the postulate of stationary states). 
According to Bohr’s theory, these stationary states can be deter- 
mined by quantizing the adiabatic invariants: 

^p^dqi=^nh, (2.47) 

where the quantum number n can assume only Integral values, that 
is, n = 1, 2, 3, , . . (in classical mechanics, the adiabatic invariant/ 
could assume any constant value). 

Second, Bohr hypothesized that, when an electron passes from 
a stationary state with energy (the initial state) to a state with 
an energy En> < (the final state), the atom radiates a quantum of 
energy (the frequency /)os/Mia^e), whose angular frequency 

w is 


(0 



(2.48) 


This equation can be written in a form similar to the classical 
expression (2.46a) for the frequency of radiation: 

= (2.48a) 

Here the quantum number k=n — n' ceui be interpreted as the cor- 
responding harmonic, and, moreover, the derivative of E with 
respect to /, which occurs in the classical ejq)resslon, is replaced 
by the ratio of the finite increments. 

Bohr’s postulate that the stable energy states of atoms form a 
discrete spectrum was confirmed in experiments conducted in 1919 
by Franck and Hertz. Passing an electronbeam (a current) through 
mercury vapor, they showed that, for electron energies less then 

4.9 ev, the collisions of electrons with Hg atoms do not effect 
the magnitude of the current. When the electron energy attains 

4.9 ev (see Fig. 2.4), the current suddenly drops. With further in- 
crease of the electron energy, periodic sharp dips of the current 
are observed (approximately every 4.9 ev). This phenomenon can 
be very simply Interpreted from the point of view of the Bohr 
theory. Let us suppose that the energy of an unexcited Hg atom 
(that is, the atom before collision) is £o and, in conformity with the 
first Bohr postulate, let us take the next possible energy value to be 
£i = £o + 4.9 ev. It then follows that a beam of electrons of energy 
£< 4.9 ev is not sufficient to raise the atoms to an excited state; 
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therefore, the collisions are elastic and the current does not change. 
If however, 4.9 ev, the electrons in the beam may give up a 
part of their energy (namely, 4.9 ev) to the atoms, and therefore, 
the current will change. If the electrons' energy lies in the range 
14.7 ev 9.8 ev, the transfer of energy to the atoms can occur 

twice; 4,9 ev is given up in the first collision and 4.9 ev in the 
second. 



Fig. 2.4. The dependence of current on 
electron energy in the Franck-Hertz 
experiment (1 ev - Cq/SOO -16* 10 


erg). 

We shall now use the first and second Bohr postulates [Eqs. 
(2.47) and (2.48)] to construct a theory of the hydrogen-like atom. 
In the equation for the radius of the orbit (2.43) and the equation for 
the energy (2.45), let us substitute for / its quantized value, which 
from Eq. (2.47) is 

I = 2nnfl. 

We thus have 




'2n^n^ • 


(2.49) 

(2.50) 


For n=\, we obtain the energy of the lowest (ground) state of the 
atom 


mpZ'^ejt 

2n^' * 


(2.51) 


and the corresponding radius 
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where 


flo — 


nioel 


0.529-10"* cm 


(2.53) 


is the radius of the first Bohr orbits 

The second Bohr postulate (2.48), combined with Eq. (2.50) for 
the frequency of radiation wnns gives 




/ J I \ 

2n» U'* 


(2.54) 


Accordingly, for 2 — 1, we obtain Balmer^s formula (2.1). The Bohr 
theory also relates the Rydberg constant, established prior to this 
on a purely empirical basis, with the Planck constant ft: 


o 


(2.55) 


This derivation of Balmer’s formula, with a value for the 
Rydberg constant that agreed with experiment, was one of the 
greatest achievements of the Bohr theory. In spite of this success, 
however, the Bohr theory has a number of inherent defects, which 
became increasingly Important in its further development. 

In the first place, the Bohr theory was obviously semiclassical 
in nature. In addition, the Bohr theory could be used to compute 
only the frequency of the spectral lines, and not their intensity. To 
find the intensity, it was necessary to resort to classical electro- 
dynamics, on the basis of the so-called correspondence principle ^ 
Finally, the Bohr postulates could not be used to construct a satis- 
factory theory of multielectron atoms, including that of helium, 
which possesses only two electrons. 


^Accorflin^ to Iht* rorrospondenre principle, in the limit, all the results of a previous 
theory (here, ( lassical elt'ctrodynannes) should follow from a new theory (here, the Bohr 
theory). 

For instan< e, for h > 0 (or at the limit ot lari'e quantum numbers), the results of quan- 
turn mechanics should appioach the classical results In exactly the same way, when 
fT V 0. the results of the lelativistic theory should approach the nonrelativistic results, 
and so on. 'Flius, the correspondence principle enables us to check the extension of a 
theory by requiring it to reduce to the classical picture at the limit. 

In the initial sta^i^s «t ^ new theory, when it cannot yet be used to investigate certain 
phenomena, the correspondence principle may bo used for reasonable extensions of the 
predictions of the old theory to the new theory Thus, for example, the Bohr theory could 
be used to calculate the frequency of the radiation, but not its intensity Bohr was able 
to determine the intensity from the correspondence principle, according to which the only 
changes ot the quantum number n that were allowed (the selection rules \n - n — n) were 
those that coincided with the classically allowed harmonics of the radiation. The quantum 
theory of radiation of light was completed only with the development of quantum electro- 
dynamic s. 
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Accordingly, we shall not discuss in greater detail the subsequent 
history of the Bohr theory. This theory represented only an inter- 
mediate stage in the development of quantum mechanics, a theory 
which can be used to determine both the frequency and intensity of 
the radiation of atoms. However, we thought that at least a brief 
discussion of the basic principles of the Bohr theory was advisable, 
for this theory still retains considerable heuristic significance. 
In particular, the Bohr theory often serves as the starting point in 
the analysis of many results related to quantization. 

The conclusions that follow from Eq. (2.54) will be discussed in 
Chapter 13, where the problem of the hydrogen-like atom will be 
solved quantum-mechanically. 

\Problem 2.^ Using the Bohr theory, quantize a hydrogen-like atom for the case of 
ellij!!Prc'"TJrWl^(rhat Is, find the spectrum of the energy levels). Show that, in the non- 
relativlstic approximation, the coordinates rand y have tlie same frequency of variation; 
that is, the motion is periodic. Show that the formula for the frequency of the radiation 
of an atom remains the same as in the case of circular orbits* 

What new feature is introduced in the theory by considering elliptic orbits? 

Solution, Using condition (2.40) and noting that, in this problem, the generalized 
momenta corresponding to the ^ and r coordinates are 

, • *l/”„ ,, , 2moZel Pf 

= = const and y 


we find 


2n 

/y= P/<P = 2xp^, 
() 


'max 

/,=2 = 2n I — + ZeJ 

''min 



There the second integral was calculated from the formula 
''max 

''min 

Hence the energy E is 


(2.56) 


E 




< 0 . 


By direct differentiation of this equation with respect to the variables and 7^, it is 
easily shown that the frequencies with which tlie r and coordinates change 

dE dE 

coincide with one another. Using the Bohr quantization rule (2.47), we replace the adiabatic 
invariants and /^by rirh and n^h, respectively, obtaining 
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where n = n^~\- = 0, 1, 2, 3, . . . and — 1, 2, 3,, . , Tills expression is completely 

identical with Eq. (2.50), which was derived for the case of circular orbits; therefore, 
the radiation frequencies remain the same as before. 

States with a given and different have the same energy, but differ in eccentric- 
ity c: 



In particular, when n^~n , we have a circular trajectory. As decreases, the 
elliptic trajectory becomes more elongated. 

The new feature to appear in the case of elliptic orbits is the selection rules = h 1, 
= 1, ±2, :1:3,,., . These account for the appearance of series that can also be 

found from the correspondence principle. In the case of circular orbits. An = An^ = ±; 1 
(see also Chapter 13), 


V Prob lem 2.2j Find the classical equaaon of motion and the trajectory of an electron 
according to relauvistic theory. Show that, in this case, the r and 9 
coordinates change with different frequencies; that is, the motion is quasi-periodic. 
Determine the angle through which the perihelion of the electron is shifted during '‘one'* 
revolution. Obtain a formula for the energy levels and find their splitting. Compare the 
results with the corresponding nonrelativlstic problem 2,1, 

Solution, Using the relativistic Lagrangian function 


where 


X~ — moc’‘ y I —[1^ 4 —I, 


(2.57) 






we obtain the equation of motion 

moV ^ Zef, 

dt “■ 


From Eq, (2,57) we determine the generalized momenta and 
Since p^ = r', where r' ^ , it follows, in accordance with the law of conservation 

of energy, that 




pi '/e^ 

mlc‘ 4-pJ 4 = con^.t, 


which implies that 




Hence we obtain the equation of the trajectory 


1 4- £ cos 79 * 


(2.58) 
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where 


_ 

ZeiE ’ 

,y:;3Ep. 


It is apparent from Eq. (2.58) that the motion is quasi-periodic. For the shift Ay of 
the perihelion, we have from (2.58) 

^ T 

With the help of (2.40) and (2.56) we get 


where 


« \ mlc^ j ’ 




ZelE 


r — ^ 

^—Po .a • 


Then, for the energy we obtain the expression 

(2.59) 


From this it is evident that the frequencies andoy— will be different. 

Using the Bohr quantization rule (2.47), we transform Eq. (2,59) to 

K \ ^ { 1 + j -— Y - -»„c^ (2.60) 

e'^ 1 

where a is the fine-structure constant. 

c/i 137 

Expanding the formula (2.60) into a series in and restricting ourselves to quantities 
of the order of we have 


H = uIqc'^ I + ‘ 











3\] 

4j|* 


(2.61) 


Since varies from 1 to n, it follows from Eq. (2.61) that the energy levels, which 
are determined by the principal quantum number « = are split into « closely 

spaced sublevels (this close spacing is a consequence of the smallness of a-). 
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llie fine-structure splitting is 

~\y 

The splitting, or fine structure, of the levels, is a characteristic result of relativistic 
effects and essentially distinguishes the predictions of relativistic theory from those of 
nonrelativistic theory (see Problem 2,1). 



Problem 2.3. jAn electron is located in a central field 

A 


V= 


A>0. 


Determine the values of s at which stable states of the system are possible. 
Answer, s < 2. 

Hint, Use the expression for tlie effective potential energy 

V ^ 

eff""2//ror3 7^' 

and make use of the fact that stable motion is possible if Keff has a minimum. 


lyblem 2.4j Find the scattering cross section for nonrelativistic electrons by nuclei 
(CoulofffflTWfflt charges). Compare the result with the scattering cross section for alpha 
particles. 

Answer. 


(h 




^roblem2j^ Show that in die relativistic scattering, unlike the nonrelativistic case, 
elecirJWfe caA be captured by nuclei. Obtain the total capture cross section. 

Solution, According to nonrelaUvlstic theory, the trajectory of electrons in the 
Coulomb field of a nucleus is given by the equation 


, ^ <7 _ . 

1 + ' t'OS cp » 


where 




moZ-e] 


At /• > 0, this equation describes hyperbolic motion, and therefore die capture of elec- 
trons is impossible. 

In relativistic mechanics, the equation of the trajectory has the form of Eq. (2.58), 
Z‘e\ 

^ *^his equation, 7 becomes imaginary. Therefore, r — Ofor 9 — 00 ; that is, 

the particle falls to the center, N\hich means it is captured. 

Since the angular momentum of the electron is related to the initial momentum p 
and the impact parameter b by the expression p^ — pb, the condition for capture can be 

Z'^e ‘ 

written in the form b^ = 5 .: , Hence the total capture cross section is 


®cap ^^max~ 





Chapter 3 


Wave Properties of Particles 

A. DE BROGLIE WAVES 


As we mentioned in Chapter 1, the development of modern 
quantum theory began with the discovery that light has particle 
properties in addition to wave properties (characterized by the 
wavelength X and the frequency w). The energy e and momentum Jt 
of a quantum of light (photon) were established by Einstein as 

£ = = 3t = M (3.1) 

Analyzing these equations, the French physicist de Broglie sug- 
gested that they could be generalized to apply to ordinary particles 
and, in particular, electrons. Generally speaking, de Broglie as- 
sumed that the wave-particle duality is not an exclusive property of 
light, but is also a characteristic of electrons and all other parti- 
cles.^ Accordingly, a beam of free electrons, whose relative energy 
E and momentum p are related to the velocity v by the equation ^ 


E:^ 


Y\-r 


nioV 


Vf- 


(3.2) 


should also exhibit wave-like properties. The corresponding fre- 
quency and wave number were defined by equations similar to 
Einstein’s: 


E = ho) and p = Ilk. 


(3.3) 


^This hypothesis was made by de Broglie with a twofold purpose; first, to provide a 
physical basis for the Bohr quantization; second, to explain the first experiments on elec- 
tron diffraction (see below). 

^From now on, we shall leave it to the reader to distinguish between the relativistic 
energy (which includes the rest mass energy) and the nonrelativistic energy. Only in 
cases in which the two energies appear in the same equation will they be distinguished 
by some kind of index, for example, 


E ^ e'+ niQC^ 
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Consequently, the de Broglie wavelength of the moving particles is 

(3.4) 

k p 

The de Broglie relations (3.3) thus generalized Einstein equations 
(3.1), derived from the photon theory. These now became equally 
applicable to the analysis of light in terms of its particles and 
of moving electrons in terms of their wave properties. It is worth 
noting that the dual character of particles and light disappears 
if Planck's constant “ft is allowed to go to zero (the correspondence 
principle). 

Taking de Broglie’s equations (3.3) as the basis of discussion, 
we may describe the motion of free particles (along, say, the x 
axis) by the so-called wave function, which for this particular case 
is analogous to that of light and represents a plane wave: 

<|.(x, = = (3,5) 

From the standpoint of Eq. (3.5), we can attempt to explain Bohr’s 
postulate of stationary states [see Eq. (2.47)]. The physical inter- 
pretation which we can offer is as follows : the only allowed circular 
orbits are those which are divisible by an integral number of 
de Broglie wavelengths, that is. 



Indeed, the wave function is single- valued only when this condition 
is satisfied. Furthermore, since in the nonrelativistic case 


Eq. (3.6) yields the Bohr condition for stationary states: 

p^ = rnQrv = Hn, (3.7) 


B. EXPERIMENTAL OBSERVATION OF THE 
WAVE PROPERTIES OF PARTICLES 

To investigate the wave properties of electrons and to prove 
that electrons have a specific wavelength X, it is first of all neces- 
sary to obtain a monochromatic electron beam (a beam of electrons 
moving with the same velocity). One possible way of producing 
such a beam is by means of an ‘^electron gun'’ (see Fig. 3.1), 
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which emits electrons with a certain definite velocity v. In the 
nonrelativistic case this velocity is given by the equation 


■ 2 300» 


(3.8) 


where is the accelerating potential (in volts) of grid A relative 
to the cathode C. Thus, after passage through grid A (at which 
point no further acceleration is imparted), the electrons will have 
a de Broglie wavelength of 

^ /t h y 150 1.2 • 10-^ 

rfhv y niaCoA} 


To exhibit the wave properties in the most pronounced form, 
one must impart to them the longest possible wavelength 
This can be done by decreasing <l>. Since, however, a certain 
amount of energy, known as the work function, is expended in the 
ejection of electrons from metal (this energy is of the order of 
several ev and gives rise to a certain spread in the electron veloc- 
ities), the smallest potential at which the beam will be relatively 
monochromatic is 15-20 volts. At these conditions, the de Broglie 
wavelength of the electrons is approximately thesame(X^ 10 * cm) 
as the wavelength of soft x-rays. 

Before we consider the physical nature of the wave- like charac- 
ter of an electron beam, let us discuss several experiments that 
have led to the direct detection 
of de Broglie waves. To begin 
with, we shall take the elec- 
tron diffraction experiments of 
Davisson and Germer , who were 
the first to make experimental 
observations of electron waves. 

Since the de Broglie wavelength 



of an electron is of the order 
of 10 * cm, they proceeded in 


Fig. 3. 1 An electron gun that produces a 
monochromatic beam of electrons. 


the same way as for soft x-rays, c is the cathode, and a IS the anode. 


using a crystal with a lattice 

constant of the order of 10 * cm for the diffraction grating. The 
set-up of the Davisson-Germer experiments is depicted in Fig. 
3.2. After coming out of the electron gun, the beam impinges 
perpendicularly on the surface of the crystal, where the electrons 


are scattered at various angles by the surface lattice. Since the 
penetration of the electrons into the crystal can be neglected, the 
diffraction grating can be regarded as two-dimensional. Accord- 
ingly, the position of the diffraction maxima can be determined 
from the condition that the path difference s==ds[n0 {d is the 
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two-dimensional lattice constant and t> is the scattering angle) is 
equal to an integral number of wavelengths (see Fig. 3.3): 




1.2 


n 




(3.10) 


where the integer n is the order of the given diffraction maximum. 
The diffraction maxima were detected with a galvanometer, which 
registered the intensity of the beam of scattered electrons at 
different angles (see Fig. 3.2). 


Electron 

gun 



Fip. 3 2 Diaf^ftim of the D«ivisson-Gernu‘r 
experiments on electron diffraction 


A second prediction of the de Broglie theory was also verified: 
namely, the prediction that as the potential increases, the angle 
corresponding to the nth diffraction maximum, will decrease in 
accordance with Eq. (3.10). Thus the correctness of de Broglie’s 

equations was completely confirmed by 
these investigations. 

It is well known that x-ray diffrac- 
tion patterns are produced not only 
by single crystals, but also by poly- 
crystalline formations. This was shown, 
for example, by Debye and Scherrer, 

Tartakovskiy and Thomson obtained dif- 
fraction patterns similiar to these x-ray 
diffraction patterns by extending this 

technique to electron waves and passing 
an electron beam through a foil.-* The 
following theoretical explanation can be 
given for the diffraction pattern of electrons. The electron beam, 
with an energy of several thousands or even tens of thousands of 

electron volts, impinges upon a polycrystalline foil, where it 



Fig. 3.3 Electron .scatlerini; 
by a two-dimensional diffrac- 
tion grating 


To reduce the absorption of electrons, a relatively thin foil (thickness of the order of 
10 cm) was used 
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encounters single crystals and is reflected from them. The path 
difference s of two rays (see Fig. 3.4) is related to the lattice 
constant doi the three-dimensional crystal by the relation 

s~2Jsincp, (3.11) 

where cp is the angle between the ray and the lattice plane. Since 
the single crystals of the foil are oriented at random, a ray can 
leave the foil at any angle with 
respect to the original direction 
(see Fig. 3.5). Among the crystals, 
there will be some that are oriented 
at just the right angle to satisfy 
Bragg’s law 

2^/sin (f =:/iX = /2 (3.12) 

where n is an integer. Whenever 
this is the case, a diffraction maxi- 
mum occurs and a bright spot Q is found on the screen. Since the 
experimental apparatus is cylindrically symmetric, the bright spots 
form diffraction rings, whose radiuses Rn can be found from the 
relation (see Fig. 3.5) 



f'lp. 3 1 Reflection from a three- 
dimensional crystal lattice 


tan 29 = f. (3.13) 

where L is the distance from the screen to the polycrystalline foil. 
Since the angle ^ is very small in these experiments (cp?=^sin 
tan cp), Eqs. (3.12) and (3.13) yield 


Rn 


L 1 2 • 10- ^ 


(3.14) 


that is, at constant L, d and n, 

/? . (/(I> = const. (3.15) 

These relations were completely confirmed by Thomson’s experi- 
ments. Electron diffraction patterns and Debye-Scherrer x-ray 
diffraction patterns are now widely used in studies of crystal 
structure. 

It is worth noting that de Broglie’s formula does not only apply 
to electrons and other elementary particles such as protons and 
neutrons, but also to complex nuclei, multielectron atoms, and 
even to molecules. True, their de Broglie wavelength is very small 
because of their relatively large mass. Nevertheless, Stern and 
Esterman have succeeded in observing the diffraction of beams of 
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helilun atoms and hydrogen molecules in reflections from LiF 
crystals. 


Polycrystalllne 



F'ig. 3.5. Diffraction of electron waves in a 
polycrystalline substance (the Tartakovskiy- 
Thomson experiments) 


A method involving neutron diffraction has been found extremely 
useful in analyzing the structure of substances. Neutrons have no 
electric charge and, therefore, pass freely through matter even at 
low energies (thermal neutrons), when their de Broglie wavelength 
is relatively large. All these facts provide convincing evidence 
that wave properties are displayed in some degree or other by all 
particles and that the de Broglie formula for X is of universal 
validity. 

The analogy between light waves and electron waves has led to 
the development of a new branch of physics, electron optics, which 
is devoted to the study of wavelike processes in electron beams. 
With the help of this new science, it has been possible to design 
and construct electron microscopes, which have found wide appli- 
cation in modern techniques. In ordinary optical microscopes, the 
upper limit of the resolving power (and, consequently, the magnifi- 
cation) is of the order of the wavelength of the light which is 
utilized. To achieve the highest possible magnification, it is neces- 
sary to reduce the wavelength of the light. The wavelength, however, 
cannot be made arbitrarily small: it is impossible, for instance, to 
construct an x-ray microscope, since no appropriate lens exists 
for x-rays. On the other hand, satisfactory electric and magnetic 
lenses can be produced for electron waves. Thus, the electric 
lens consists of a capacitor which has an aperture in the middle 
of the plate, while the magnetic lens consists of ordinary magnetic 
coils. 


4 

Modem optical microscopes give a magnification of approximately one or two 
thousand. An electron microscope can give a magnification of more than 100,000. Besides 
the electron microscope, the proton microscope is also widely used at present. Its re- 
solving power can be made even greater than that of the electron microscope. 
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C. WAVE PACKETS. GROUP AND PHASE VELOCITIES 


From de Broglie’s hypothesis it follows that the motion of a 
free material particle having an energy E = mc^ and a momentum 
p=zmv, can be described by means of the plane wave (3.5). The 
velocity u of the de Broglie wave can be found as the time rate of 
displacement of a constant phase 


Et — = const; 

that is, the phase velocity is^ 



(3.16) 


(3.17) 


According to the theory of relativity, the velocity of the particle v 
cannot exceed that of light in vacuum (c). However, the calculated 
phase velocity of the wave appears larger than c. This would indi- 
cate that it is theoretically impossible for a monochromatic wave 
to transport a particle or carry energy, since all particles and 
energy must travel with a velocity smaller than that of light, in 
accordance with the principle of relativity. 

®This equation can be obtained from the following simple considerations. The wave 

function depends only on the phase ch - (Et - px) therefore, at the time = t + At, 

h 

this phase moves to the point Xj - r f A.i:, which can be found from the equation 


Thus 


Eti - pxi - Et ~ px = constant . 


E A I - p A X = 0 


Hence the rate of propagation of the constant phase (which is also the velocity of the 
wave as a whole, as can be seen from Fig. 3.6) is 

Ajc E 
At p 



Fig. 3.6. The propagation of a monochromatic wave. 
During the time At, the wave as a whole moves a dis- 


tance A:jc. The phase velocity of the wave is 

At 
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An escape from this dilemma was found in the early stages of 
the development of quantum mechanics. This solution retained the 
wave properties of particles, which had received such striking 
experimental confirmation. Thus, each particle was associated 
with a group of waves of nearly equal frequencies, rather than with 
a single monochromatic wave. This was further suggested by the 
fact that the diffraction lines of electron waves were always ob- 
served to have a certain definite width. Thus, it seemed that several 
waves with very nearly equal frequencies were diffracted, rather 
than an individual wave. Yet another basis for this solution was 
provided by spectroscopic studies, which showed that all spectral 
lines are also characterized by a certain definite width. 

One advantage which results from using a set of waves with 
nearly equal frequencies rather than an individual monochromatic 
wave is that it is always possible to construct a wave packet whose 
resultant amplitude is appreciably different from zero only in a 
certain small region of space. This small region of space can be 
associated with the position of the particle. 

On the basis of these considerations, let us attempt to describe 
the motion of a particle by constructing a wave packet out of a 
continuous set of waves, assuming that the momentum //, which is 
connected with the wave number k' by the de Broglie relationship 

k'z=p'jn, ranges from p to (Ap c^p). We shall take the 

amplitudes A (p') of the individual waves to be 


A{p')-. 


0, 

0 , 


n’<p—Y> 


Ap^- 


p ^ p 


.Sp 

-f- — . 


P'>P + f. 


( 3 . 18 ) 


The resultant wave function is 


p-l- 


Ap 


-I ^ 


A - - p'A) 

ir^ " 

A/? 


dp'. 


P— -i 


( 3 . 19 ) 


In this equation, let us change from the variable p'to the variable 
p" -~p' — p'^—Y'-^P’ ’ ^P' = ^P’) expand E' in a series 

about the central point p' = p: 
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Restricting ourselves for present purposes to the first-order terms 
(terms proportional to and integrating Eq. (3.19) with 

respect to dp\ we obtain 


t D -P-') 

^ = Be ^ , 


(3.21) 


where the amplitude of the wave packet is 




Sill S 




(3.22) 


From this equation, it follows that B does not remain constant 
either in space or in time. 

To determine the velocity of the wave packet as a whole, that is, 
the group velocity Ti corresponding to the motion of some specific 
amplitude, we shall use the same procedure as for the phase 
velocity. Let us take a certain constant value of the amplitude such 
that 

( 3 . 23 ) 

We then obtain the following equation for ti: 

u = = (3.24) 

dt dp * 

Since £ = c: p'^ -f- for a free particle, and 

dE c-p 

dp~ E'’ (3.24a) 

we find 

= (3.24b) 

which shows that the group velocity li of the wave packet as a whole 
is exactly equal to the velocity v of the particle itself. 

- From the expression for the amplitude £(5) = i4-^^ of the wave 

packet at time t = o (when the amplitude corresponds to^ = ^x; see 

Fig. 3.7), it is readily seen that the maximum value of this 
amplitude B{0) = A lies at the point At all other points cor- 

responding to the relative maxima the amplitude is smaller. In 
particular, considering different values of the argument we have 


B 


I 

Bl- 




= ^A. 

5r. 
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At the point $ = the amplitude vanishes (5 (ir) = 0). Consequently, 
at t = 0 the wave packet may be regarded as localized in the region 

of the first maximum, that is, in the region A? Hence, 

we obtain the relation 

(3.25) 

which is known as the uncertainty principle. 



Fig. 3.7. Form of the wave packet at t = 0. 

Since the center of mass of the Wave packet (that is, its principal 
maximum) travels with the velocity of the particle (u=zv)^ the wave 
packet describes the localization of the particle. In particular, 
it follows from other h)q)otheses that have been made that the 
position of the particle is characterized by the square modulus of 
the amplitude of the x wave, namely. 

Consequently, the quantities A^irand Apmay be regarded as a meas- 
ure of the accuracy with which it is possible to compute the 
momentum and position of a particle in space by means of the 
wave theory. 

It was also necessary to determine whether ^ waves could be 
identified with the structure of particles, or whether these waves 
characterize only their motion. The first interpretation of the 
relationship between a particle and its associated wave was proposed 
by Schrbdinger. In terms of his hypothesis, a particle is a wave- 
like formation and the density of its “smearing out” over space is 
given by 
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In theory, a group of waves can always be used to form a wave 
packet whose size is of the order of the radius of any given particle 
(for example, an electron). This representation of the particle is, 
however, unstable. Indeed, as follows from Eq. (3.17), the phase 
velocity of each of the monochromatic waves contained in the wave 
packet depends on the corresponding wave number, or momentum 
p. Accordingly, each of these monochromatic waves propagates 
with its own phase ® 


w = — 

P P 

As a result, the wave packet gradually spreads out with time. The 
“spreading” time is determined by the time interval in which the 
initially disregarded part of the phase of ^ in (3.21) becomes com- 
mensurate with 'IT. According to (3.20), the neglected part of the 
phase is proportional to p'^^(Ap)^ Thus, from (3.20) we can obtain 
a measure of the time M which elapses between the initial forma- 
tion of the wave packet and its distortion: 


A/. 


(Spy d^E 
2n * dp'^ 


TC. 


Or, the time interval starting with the formation of the wave packet 
and ending when the distortion of the latter can no longer be 
neglected is 




2izfl 


d^F • 


(3.26) 


Let us replace ^p by its value from (3.25). Then, in the non- 
relativistic case (p<moc), Eq. (3.24a) yields 

dp mo* 
dp^ mo ’ 

Thus we obtain for the “spreading” time of the wave packet 

(3.27) 


can be seen from this that a stable wave packet can be formed only for a particle 
which has zero rest mass (niQ = 0) and which is moving in a vacuum (as an example of 
such a particle, we may take the photon). It is only in this case that the phase velocities 
of all the components of the wave package are the same regardless of the wave number. 
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For a particle with m„^lg and ^x^^0A cm, the ‘‘spreading” 
time is A/ ^10^® sec, and thus the wave packet does not actually 
spread. In the case of electrons, however, g and 

Ax^lO ** cm. Therefore, the wave packet of an electron begins 
to spread out after A/--v>10 sec, that is, almost instantaneously. 
Thus, the electron is not a stable formation in the Schrbdinger 
theory of the “smeared” electron. This is in obvious contradiction 
with the experimental facts. Moreover, it becomes impossible to 
explain the phenomenon of diffraction if a monochromatiq wave, 
which provides an appropriate description of the motion of several 
electrons, is replaced by a set of several wave packets. 

At present, another approach has been adopted, namely, Max 
Born’s statistical interpretation of the wave function (the quantity 
I as the probability density, or probability of finding the 
electron at various points in space. The statistical interpretation 
is not concerned with the structure of the electron, and the electron 
can remain a point charge (or, to be more precise, a charge whose 
radius does not exceed 10 cm). Only the probability of finding 
the electron at different points in space changes as the wave func- 
tion changes with time, but the structure of the electron remains 
completely unaffected. 

From the standpoint of Born’s statistical interpretation, the 
quantum-mechanical treatment of problems involving many elec- 
trons in identical states presents certain methodological similarities 
with the treatment of various problems in the kinetic theory of 
gases on the basis of Maxwell’s distribution function (it should be 
noted, however, that the quantum-mechanical distribution function 
f = devoid of the temperature term). For example, in Born’s 
interpretation, the diffraction pattern of an electron beam may be 
explained in the following manner. The bright spots correspond to 
the maxima of the function f = l^ and thus the greatest numbers 
of electrons travel towards bright spots. On the contrary, the 
probability is smallest for electron motion in the direction of dark 
regions. 

One attempt to explain a certain “freedom of behavior” of an 
individual electron is based upon the complementarity principle, 
a solution adopted by Bohr, Heisenberg, and others (see also 
Chapter 7). The complementarity principle asserts the theoret- 
ical impossibility of extending our knowledge of the microscopic 
world beyond a certain finite, even though small, limit of accuracy, 
since our measuring apparatus necessarily must exert certain 
indeterminable effects on the experimental object (an electron, 
for example). In particular, for canonically conjugate quantities, 
such as the position and momentum, the degree of accuracy 
with which they can be measured is given by the uncertainty 
principle. 

Heisenberg attempted to provide a more rigorous foundation for 
the complementarity principle by means of the following hypothetical 
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experiment. Suppose we wish to determine the position of an elec- 
tron with the aid of an “ultramicroscope,*' that is, an instrument 
that is designed for precisely this task and that utilizes a light 
beam of appropriate wavelen^h X (it is not actually possible to 
construct such a microscope). We shall assume that the electron 
is located at the vertex of a cone of revolution with angle 2f, (the 
objective being the base), where (fc=.the angle between the incident 
(wavelength X) and scattered light beams. Any light that enters 
the objective after being scattered by the electron must have 
traveled within this cone of directions. From the laws of optics, 
it follows that the uncertainty in the determination of any of 
the electron coordinates in the plane parallel to the plane of the 
objective is 


(3.28) 


Moreover, since the light has a momentum p==j, part of this 

momentum is imparted to the electron (the Compton effect). Ac- 
cordingly, the momentum of the electron in the given direction can 
only be determined with an uncertainty of 

^Px ^ Y 2 ^) 


The product of these two uncertainties yields the uncertainty 
relation. 

From (3.28), it is seen that to determine the position of the 
electron with the maximum degree of accuracy, it is necessary to 
use light of the smallest possible wavelength X. Equation (3.29) 
shows, however, that the smaller the wavelength, the greater is the 
momentum imparted to the electron (the position and momentum 
are canonically conjugate quantities). Thus, according to the com- 
plementarity principle, there would have to be two classes of 
experimental aparatus. One of these would be designed to measure 
the spatial coordinates with any desired degree of accuracy (in the 
case of “ultramicroscopes,** instruments with X->0), the other, to 
measure the corresponding momenta (instruments with X-^r^), 
Then the observer using an instrument of the first class imparts 
all of the indeterminable effects to the momentum, whereas the 
instrument of the second class imparts all of the indeterminable 
effects to the position. Accordingly, in the opinion of the adherents 
of the Copenhagen school, we cannot at any given time ascertain 
both the position and the momentum of the object, although each of 
these variables can be measuredseparately with any desired degree 
of accuracy. 
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VProblem3j^ Show that there is an integral number of de Broglie wavelengths in an 
emp!R^!ec8?8!lftrblt. 

SoluUoiu According to Bohr, the set of stationary elliptic orbits may be found from 
the two conditions 


where 


§ 


pr dr = rifh 


and 





■ df 


and 






Since 27'= that is, 

ft 

T “C 

2f Tdt='^^pi,^^dt=^jipi,dqi, = nh, 
Q kb k 


where 


n 


n® + «r. and 


2T dt = mov ds^-^ ds, 


we obtain the required result 



= ?i. 


Problem 3^2 j Show that the wave 


V (a) ^ 


giffQ X 
0 


I A i < 2 » 
|a!>”, 


is not monochromatic for finite values of /, Find the range of wave numbers Ak~k—ko 
over which the amplitudes of the individual harmonics may be regarded as nonzero. 
Solution, Let us represent ^ (x) in the form of a Fourier integral 


oo 



— oo 


where the amplitudes /4 (/f) are given by the formula 

4-00 

/!(*) = 4' \ <((x) (-‘"’^dx. 

00 

Substituting the expression for y (x)^ we find 

sin^ {k—'ko) I 

Hence it follows that the largest amplitude corresponding to ife = /fo is /I (ko) == l/2n. 
Although all the other amplitudes are not equal to zero, in practice only those amplitudes 
which are of the same order as A (ko) need be regarded as nonzero. These amplitudes 

lie within the range ^k=i\k — I ~ y . The quantity Aft 2«// may be regarded as the 
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* ‘spread'* of the wave vector due to the finite width of the wave packet. Setting 
and / ^ Ax, we again obtain the uncertainty relation: 

A/? Ax -- h. 

Find the mean velocity and the ‘‘spreading** time of the wave packet 

00 

<f (x, 0 = J /i (ft) e- dk (3.30) 

— oo 

if the amplitude A (k) is in the form of a Gaussian curve: 

_ ~ 

A{k) = e ' 



n * 


Solution, 
f or f = 0 


Substituting the expression for A{k) into Eq, (3.30) and integrating, we find 


It follows that the particle is initially localized in the region Ax 


— . This in effect is 

q 


equivalent to the uncertainty relation (3.25). 

To obtain the shape of the wave packet at any other instant of time, we take into account 
the equation 




In the nonrelativistlc approximation (ft < moC//i), we then have 

(0 V- A ' — , 

ti ^ 2mo 

Substituting this expression into Eq, (3,30), we obtain, after integration 

_ ^ mn ' 

9 . 






4- i {kox — tuoO 




where 


Woc* , fikf, 

“»=“r + 2m;- 


Hence we have for the probability density 

01^ = 


nq^ 


(X — T>o/)« 


Z'+l'S' 


where Oo is the velocity of the center of mass of the wave packet. From this formula 

ITta 

it is seen that the maximum of the wave packet, that is, the point x = Oof, moves with the 
velocity Vo of the particle. 



56 


NONRELATIVISTIC QUANTUM MECHANICS 


The effective width of the wave packet at time t is found to be 


If we substitute i into this formula, the "spreading" time of the wave packet is 

i\X 

expressed by a relationship which coincides with Eq, (3.27), 

Show that a damped oscillatory motion 
X (t) = cos 



results in broadening of the spectral line. 

Solution, The damped oscillation can be represented in the form of a Fourier integral 

oo 

x(l) = J a (oi) cos d(ji, 

0 

where 


a ((o) — 


2 

TZ 



COS dt = 


1 


" ■(“ + (“ 




For the largest amplitude, we obtain 


The amplitudes with the same order of magnitude correspond to frequencies lying in the 
range 


A(i) = I (i> — I y. 

It is this equation which represents the line broadening since, in effect, only the 
amplitudes corresponding to these frequencies differ from zero. 

Since the damping coefficient 7 is connected with the mean life of the damped oscilla- 
tions by the relauon At I/ 7 , the above equation yields the familiar optical relationship 
connecting the broadening of the spectral lines and the mean life of the atom: 

At A(t) ^1. 

Hence, since Aw — A/://i,we obtain what is known as the fourth uncertainty relation 


AEAt h. 



Chapter 4 


The Time-Independent Schrodinger 
Wave Equation 


Planck’s quantum theory, Bohr’s postulates, and de Broglie’s 
hypothesis represented very important steps in the development of 
the theoretical foundations of atomic physics. However, they were 
overshadowed by the discovery of a fundamental differential equa- 
tion describing the electron and accounting for its wave properties, 
and the construction of a theory accounting for the quantum nature 
of radiation. The crucial move in this connection was made by 
Schrodinger in 1926, when he proposed a partial differential equa- 
tion that turned out to be generally applicable to the motion of 
charged particles in the nonrelativistic case (v %^ c). This equation 
represented a generalization of the classical Hamilton-Jacobi 
equation to cases in which the de Broglie wavelength differs from 
zero. The Schrodinger equation stands in approximately the same 
relation to the Hamilton-Jacobi equation as does wave optics to 
geometrical optics. 


A. DERIVATION OF THE TIME-INDEPENDENT 
SCHRODINGER EQUATION 

We shall show how the Schrodinger equation can be obtained 
most simply. We must insist that there can be no question of a 
rigorous or general derivation of this equation, since it is not, in 
general, possible to set up a new theory entirely on the basis of 
old postulates. We shall adopt a mode of presentation which con- 
sists essentially of a reasonable generalization of the wave equation 
from classical electrodynamics or optics 

V V (r, t)- - JO = 0 (4.1) 

to the case of de Broglie waves. Here is a function describing a 
wave disturbance propagating with velocity u. If the wave is mono- 
chromatic, a solution to Eq. (4.1) may be sought in the form 




(4.2) 
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where a) = 2Kv is the angular frequency, and the spatial part 4^(r)of 
the wave function satisfies the equation 

mr) + ~A(r) = 0. (4.3) 


In this equation, we can use a single parameter in place of the two 
parameters o) and u, namely, the wavelength 


X 


2nu 

ca 


(4.4) 


We then have 


= (4.5) 

From this general equation, we can obtain a wave equation describ- 
ing the wave motion of electrons by substituting the de Broglie 
wavelength 


A 


j __2iin 

tlldV p * 


(4.6) 


Using the law of conservation of energy 

-f V (r) = £ == const , 


we have 


4r.* 




(4.7) 


Substituting this expression into(4.5),weobtain the fme-inde/>endent 
(or stationary) Schrbdinger equation 

^*<K/')+^[£-V'(/-)]-)'(/') = 0. (4.8) 


Once we have found the space -dependent part (|> (r) of the wave func- 
tion from (4.8), we can use Eq. (4.2), which is valid for all mono- 
chromatic waves, to obtain the complete wave function, which 

depends on both (spatial and time) coordinates. Substituting w = 

we have^ 


( 1,(0 =8 ^ f 


(4.9) 


^From now on, we shall write the wave functions which depend on both position and 
time in the form while the wave functions whose only argument is the position will 

be denoted as i/j. 
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For the complex conjugate function, whose space-dependent 
part also satisfies Eq. (4.8), we have 


E 

r(t) = e^ 


«j>*. 


(4.9a) 


B.,>/^ESTRICTIONS ON THE WAVE FUNCTIONS. 
EIGENVALUES AND EIGENFUNCTIONS 


The functions ^ (t) which describe the behavior of a particle may 
be statistically interpreted by means of the Schrodinger theory. In 
particular, the quantity which plays the role of a 

distribution function, represents the probability density^ or prob- 
ability of finding the particle at any particular region in space. If 
the probability density differs from zero only in some arbitrarily 
large, but finite, region of space 2, it is accurate enough to say 
that the particle is located somewhere in this region. In other 
words, the probability of detecting the particle in the region Q is 
unity. Mathematically, this can be expressed in the form 



(4.10) 


In quantum mechanics, relationship (4.10) is called the normaliza-- 
tion condition. 

It is important to note that the region of nonzero probability 
density is not always finite. There are cases where does not 
go to zero over all of space (the simplest of these cases is that of 
the motion of a free particle, which we shall consider below). When 
this happens, diverges, and a somewhat different formula- 

tion of the normalization condition must be given. 

We shall now give a general analysis of the Schrodinger wave 
equation. The Schrodinger equation (4.8) is a second-order partial 
differential equation. Its solution resembles the solution of certain 
classical problems of mathematical physics, such as the problem 
of a vibrating string. We note, first, that certain conditions 
must be imposed on the wave function since it is a solution 
satisfying a second-order Sturm-Liouville equation. It must be 
continuous and have a continuous derivative. Moreover, it must be 
single valued and finite over all of space. Finally, it must satisfy 
certain boundary conditions. In general, solutions whi ch s atisfy 
these requirements do not exisTIor all values of the parameters, 
but only for certain specific values, which are known as eigen- 
values. In the case of the Schrodinger wave equation, the energy 
£■ is a parameter of this sort, its eigenvalues being 


El, Ea, Ej, . . . . 


(4.111, 
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The solutions of the wave equation corresponding to these eigen- 
values 


h* ••• (4.12) 

are said to be eigenfunctions. The possible values of the energy 
form the energy spectrum. We shall see below that, if the motion 
of a particle is not bounded, its energy spectrum is continuous. 
If, however, the position of the particle in space is bounded, the 
energy spectrum is discrete. 

Let us show that the eigenfunction satisfy the orthonormality 
condition 


J = Kn'* (4. 13) 

where is the Kronecker-Weierstrass symbol, which is equal 
to unity for (the normalization condition), and to zero for 

n' An (the orthogonality condition). To prove Eq. (4.13), we write 
the Schrddinger equations for and 

^^%An:(En-V)^n = 0, (4.8a) 

^^rn' + -nHEn'-V)rn' = 0. (4.8b) 

If we multiply the first of these equations by and the second 
and then add the two resulting equations, we obtain 

(/■„ - £„.) = U. (4.14) 


Since 


v-B, 

where 


integration of Eq. (4.14) over all of space yields 

J V . B + ^ (£„ - En-) J = 0 

Since the I function tends to zero at infinity, we obtain^ 
j V ■ B(Px=(jB„dS = 0, 


(4.15) 


^The surface S - 47rr^ tends to infinity at r > oo. Therefore, the integral goes to zero at 
/• -> fxj if the wave function l// tends to zero more rapidly than r~^. This condition is always 
satisfied for a discrete spectrum since the wave function, as a rule, approaches exponen- 
tially zero at infinity. The case of the continuous spectrum will be considered separately. 
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Therefore, instead of Eq, (4.15), we have 

(£„ - = 0. (4. 16) 

We shall now assume that E„~/zEn' (that is, n'-y^n). According to 
Eq. (4.16), the following equation then holds (orthogonality condi- 
tion) : 


^ (4.17) 

If, however, n' = n (or En = En ), this integral does not go to zero 
and we may impose the requirement that it be equal to unity (the 
normalization condition): 

(4.18) 

Thus the eigenfunctions <l>i, ^^2 and which correspond to the eigen- 
values El, Ei andEs, do indeed possess the property of orthonormality 
(4.13). This is one of the most important characteristics of eigen- 
functions. 


C. A PARTICLE IN A POTENTIAL WELL 

As an example of the calculation of eigenvalues and eigenfunc- 
tions, we shall consider the motion of a particle in a potential well. 
Since the chief interest of this problem is simply that it provides 
an illustration of methods used in the solution of this example, we 
may assume a very simple dependence of the potential energy on 
distance (see Fig. 4.1): 


V(x) = 


V„ for 
0 for 
V„ for 


-;o<.v<0 (region I) 

0 (region II) 

1 <1 -v < CO (region III) 


(4.19) 


In the potential well (region II) , where E'^V =0,the Schrodinger 
equation takes the form 


where 


and 


Am 





(4.20) 


(4.20a) 
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We note that the case has no physical meaning in this problem. 
Since the general solution of Eq. (4. 20) is oscillatory, we have 

(|)n z= iSii cos + 4 11 sin kx, (4. 21) 


In regions I and III, the Schrodinger equation has the form 

= (4.22) 

Here two cases must be distinguished. In the first case (E^V^o), 
the solution for these regions is also oscillatory in character (an 
equation of the elliptic type). It is given by Eq. (4.21), the value 
of k being 

k=-LY2m,(E~V,). 

No restrictions need to be imposed on the wave functions at infinity. 
Therefore, the energy E can assume any value in a continuous 
spectrum of energies. It is better, however, not to investigate the 
case of a continuous spectrum on the basis of this example, but 
rather on the basis of the motion of a free particle (see below). 
The potential well only adds to the mathematical difficulties of the 
problem, without changing the general character of the solution. 


V(x) 



Fig. 4.1. The motion of a particle in a 
potential well. 


In the second case, namely, the case of a potential barrier 
(£<Vo)» the solution of Eq. (4.22) is exponential in character (an 
equation of the hyperbolic type). The general solution can be 
written in the form 


^1. Ill = 4 1. 111 ^’^ + IlIe''**^ 


(4.23) 


where 


x‘=^2m.(Vo-£) = -!^>0. 


(4.24) 
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If the energy can assume any value without restriction, the wave 
function inside the potential barrier (0 <;£<^l/o) will contain both 
an exponentially increasing part and an exponentially decreasing 
part (see Fig. 4.2). Therefore, we must choose only those values 
of E for which exponentially increasing solutions do not exist inside 

Exponentially 



Fig. 4.2. Wave function for a given value of E. 
The energy level is taken to be the abscissa of 
the wave function 


the potential barrier. Accordingly, we require that the coefficient 
£1 = 0 in region I (-v<0),andthe coefficient Am =0 in region III 
We then have 


= (4.25) 

where, for the sake of simplicity, we have made 




By joining the solutions'^ at the boundary of regions I and II (a: = 0), 
and also at the boundary of regions Hand III (x = l), and making use 
of the requirement that the exponentially increasing solution vanish, 
we obtain the equation for the eigenvalues of the energy £. 

We shall now further simplify our problem by requiring that 
Fo, together with x, go to infinity (see Fig. 4.3). It is apparent from 
Eq. (4.25) that (J>i=:i))m = 0, and therefore the boundary conditions 
for the solution (4.21) inside the potential well (region II) take the 
form 

^11 = 0 for x = 0 (4.26) 

and 

t|;n = 0 for x = l. (4.27) 


^When E < Vq, the number of unknown coefficients in the wave function is smaller 
than the number of imposed conditions. Accordingly, solutions are possible only for certain 
values of E and a discrete spectrum is obtained. 

^In joining the solutions, we must match the actual wave functions (and also their 
first derivatives) at the appropriate point. 
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Applying Eq. (4.26) and (4.27) to the general solution (4.21) in 
region II, we find that Bn = 0, and the eigenvalue are described by 
the equation 


sm/e/==0, (4.28) 

from which 

kl = nn, (4.29) 

where n = 1, 2, 3, 4, ... . We exclude the value n=0 from further 
considerations, since the wave function in this case is identically 
equal to zero. It is not necessary to consider separately the nega- 
tive values of n, since the wave functions for negative n are equal 


/~0 



Fig. 4.3. Particle in a potential well with 
infinitely high walls. 


to the wave functions for positive n, taken with the opposite sign. 
Since = we obtain the following equation for the energy 

spectrum (the eigenvalues); 


E^ = 




(4.30) 


The wave functions corresponding to these values of energy (eigen- 
functions) are 


= A . (4.31) 

The coefficient A„ can be found from the normalization condition 

I / 

^ '!fldx= A'„ i|^ sin® nt j dx—-~ Al — 1 , 

0 0 
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which gives 


^.=1/4. 

Substituting the expression for into Eq. (4.31), we finally obtain 

^n= ]/" -jsinu/i (4.32) 

According to the general theorem of eigenfunctions [see Eq. 
(4.17)] the eigenfunctions (4.32) of the Schrodinger equation satisfy 
the orthogonality condition 


= 0 for (4.33) 


as can be readily seen by performing the direct integration after 
substituting Eq. (4.32) for 

We shall now write down a few specific eigenvalues En and 
eigenfunctions shown in Fig. 4.3: 


p 

* ‘2m„P 

> 'I'l = ]/^7 sin ^ , 

(4.34) 

E, = AEi, 

|/'7sin2iry, 

(4.35) 

Et = 9Ei, 

jsu\3r.j. 

(4.36) 


These solutions are very similar to the familiar standing-wave 
solutions for a vibrating string with fixed ends. The case n = 1 
[see (4.34)] corresponds to the fundamental mode, the case n = 2 
[see (4.35)], to the first harmonic, etc. 


D. THE MOTION OF FREE PARTICLES. 
NORMALIZATION OF WAVE FUNCTIONS IN THE CASE 
OF A CONTINUOUS SPECTRUM 

We shall consider the motion of a free particle (taking the one- 
dimensional case first), when the Schrodinger equation (4.8) has 
the same form in all of space ( — od<^a'<[oo): 






(4.37) 
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where 


= (4.38) 

The solution of this equation is 

+ (4.39) 

To determine the physical meaning of each term in Eq. (4.39), we 
write the complete, time-dependent function 


= + (4.40) 

It is seen that the first term describes the motion of the 

particle in the positive direction of the x axis, and the second term 
the motion in the negative direction. If we restrict our- 
selves to the traveling wave which is propagated in the direction of 
positive X, we have 

= (4.41) 


It is readily seen that the integral ^ diverges. Con- 

— oo 

sequently, we must revise the method of normalization [see Eq. 
(4.10)]. There are two basic methods of normalizing ^functions of 
this sort. We shall devote most of our attention to one of them, 
which was proposed by Born. Thus, rather than imposing a bound- 
ary condition, we shall subject the wave function to a periodicity 
condition 


1'W = 'l'(A^ + ^). (4.42) 

where the length L is said to be the period. It can be made 
arbitrarily large. As a rule, L does not appear in the final result. 
Rewriting (4.42) in the form 


we obtain 1, which implies 


where /i=0, ± 1, 
energy levels are 


k=J^, (4.43) 

:2, its From Eqs. (4.38) and (4.43), the 


r 




2/^0 


(4.44) 
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Since the ^ function is periodic in the interval L, the normalization 
condition becomes 


I ^ 

-L/3 


1 . 


(4.45) 


Substituting the expression (4.41) for i]i, we have 


A = 


VI' 


(4.46) 


Therefore, the normalized solutions are 


1 .2«/» 

% = L ^ = e . 


(4.47) 


The direct integration shows that the functions (4.47) are 
orthonormal. As can be readily shown by direct integration: 


for nf n, (4,48) 

for n' = n. 

Therefore, introducing the artificially defined period, we can 
transform the continuous energy spectrum into a discrete one. 
If, however, the length L is allowed to go to infinity, the discrete 

spectrum again becomes continuous. Since = and A/i=l, 

the energy difference between two neighboring levels is 


1/2 LrA ^ ^ 

-. 1/2 - 1/2 

sin 7c (/I — n') 

— “'ll 


Wo L L 


(4.49) 


It immediately follows that AE—*0 if L-^oo; that is , the energy 
spectrum becomes continuous. 

We shall now generalize the free particle problem to the three- 
dimensional case. The Schrodinger equation can be written in the 
form 



(4.50) 


Just as in the one-dimensional case, the quantity is given by 
Eq. (4.20a). We shall solve Eq. (4,50) by the method of separation 
of variables; that is, we assume a solution of the form: 


(4.51) 
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Substituting this expression into (4.50) and multiplying by 

1 1 


we find 


r(x) I u2 


0 , 


(4.52) 


where the primes denote the derivatives of the corresponding']^ 
function with respect to its argument. We note that an equation of 
this type is satisfied only if each term (fraction) is independent of 
the coordinates and is equal to a certain constant. We thus obtain 
the following equations for the functions ^(a:), ^(ij) and^J^(^): 

f ' (X) -I- //;6 {X) = 0. f ' (y) + {y) = 0, 

f'(2) + *:'Kz) = 0, (4.53) 

where 

k] (4.54) 


Taking a traveling wave propagating in some specific direction as 
a solution for each of Eqs. (4.53), we get 

(j> (a:) = ^{z)=Ce^f^AZ, (4.55) 


The unknown coefficients A , B and C are determined from the 
normalization condition, assuming that the functions $ (a:), ^ {y) and 
(^z) are periodic in the interval L. We thus obtain 

_ 1 _ I 

tp(ji[:) = L ^ (y) = L ^ (4.56) 

_ I 

([) ( 2 ) = L 2 gfM, 


where 


and 


t 2ji , 2it , 2n 

k\ — nx^, — = 


III, lit, rtj = 0, ±1, it 2, ±3, 
In this case, the particle has energies of 




titoL^ 


(4.57) 


(4.58) 


where 


n'* = nl + n5 + n=. 


(4.59) 
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Substituting the functions (4.56) into Eq. (4.51), we have 

_ 3 

^l^ = L 2 g/ft r (4.60) 

These wave functions will satisfy the orthonormality condition 

The complete, or time-dependent, wave function can now be written 
in the form 

^{t) = L {t/tl){Et-p.r)^ (4.62) 

where 

p==hk, E = £-. (4.63) 

We note that the energy spectrum is again continuous, just as 
in the case of one-dimensional motion. This can readily be shown 
in the same way as before by finding the energy difference between 
two adjacent levels and then letting L go to infinity. 


E. FUNDAMENTAL PROPERTIES OF 6 FUNCTIONS. 
6-FUNCTION NORMALIZATION OF THE WAVE FUNCTION 
IN THE CASE OF A CONTINUOUS SPECTRUM 


3 OF A CONTINUOUS SPECTRUM 

^ lRrA%cia 

one oT the fundamental properties of eig 


As we have already mentioijprffone oT the fundamental properties of eigenfunctions is 
their orthonormality [see Eq. (4,13)], An arbitrary function /’(.v) with no “special** 
properties can thus be expanded in a series of orthonormal functions belonging to a 
complete set: 




(4.64) 


Multiplying this equauon by (.\) dx and integrating over the entire range of x^ we 
obtain 


J (.V) F(.x) dx = 2 5 'fn (A-) dx. 


Because of the orthonormality condition, we have 


I 

t 


^ (■'■) f-' (■'■) dx. ( 4 . 65 ) 

In order to bring (x) under the Integral sign, we change here the variable of integra- 
tion .V to a' and substitute the expression for into the expression (4.64): 


n 


(4.66) 
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This expansion is a generalized Fourier series. We can obtain the ordinary Fourier 
series from it by substituting the harmonic functions (4.47) for (x). 

We note that the sum over n in Eq. (4.66) is a 6 function: 


= (4.67) 

n 

since it transforms the function F(x') into F(x)^ 

In this respect, this 8 function resembles the Kronecker-Weierstrass symbol, which 
possesses a similar property with respect to the subscripts n and n': 


Yf * 

n* nn* 

n' 


==/n. 


It follows that the function differs from zero only in a very narrow region x' ~ x^ just 
as the I symbol differs from zero only for n' — n. By setting F(jc') = I in (4.66), we obtain 
one of the fundamental properties of the 6 function: 


\ 6 (x — x^) dx* = 1 . 


(4.68) 


It is interesting to note that the relations (4.66) and (4.68) are completely independent of 
the particular set of orthonormal functions used to construct the & function. In the simplest 

case, the & function can be constructed from the 
orthonormal functions (4.47), which are used in 
expansion into a Fourier series: 



Fig. 4.4. Graph of the “smeared" 

5 function. 

If we now let L go to Infinity (Lk - 


1 V ^ 


(4.69) 


2nn 


Introducing a new variable ^ = and making 


Ai. 2 k. 2n 

ak -j- iin ^ 


use of the relation 
A/i = 1), we can transform (4.69) to 
i{x 

* 0), the last sum is changed into an integral: 


(since 


a:') = **-•*'*. 


8 (x - x") = 2 *^- ^ dke'" <* - •'’> . 

The range of integration over k should be taken from — oo to + oo. We note that the & 
function belongs to the class offunctions that are known as “improper functions.** This is 
manifested in the impossibility of direct evaluation of the integral (4.70), since it diverges 
at jc = x\ Tlierefore, if we wish to obtain a representation of the function, we have to 
“smear out** somewhat the Integrand of (4.70), For instance, the 8 function may be 
‘'smeared out** as follows: 


6(a; — a:', a)= ^ | + 

OO 

f e'“*co3ft (AT — Ar’)rfft = — 

" J ' ' It a*4-(A:— At')** 


(4.71) 


A graph of the smeared 5 function is given in Fig. 4.4. At the limit a — ► 0, the function 

must have the following properties: 
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8 (jc — jc') = oo for X = x\ 

8(a:-x') = 0 for (4.72) 

As we approach the limit a 0, the area between the 8 function and the x axis remains 
unchanged: 

(4.73) 

The three-dimensional 8 function can be similarly defined as 

» (r) = J (^) i (y) i (z) = J . (4.74) 

It satisfies the following conditions: 

8(r) = 0 if r 17 ^- 0 , 

^t(r)d>.v=l. (4.7S) 


The integrals (4.74) and (4,75) are three-dimensional. The former extends over the 
entire wave-number space (Af), and the latter, over the entire coordinate space (r). 

It is very convenient to express the density of a point charge with the help of a three- 
dimensional 8 function; 


P (r) — e6 (r). 

Substituting this expression into Poisson's equation 

V'" <I> = — 4)t(>s (r) = — J 
we can determine the potential of a point charge ' 


(4.76) 


2r.* J /f-* nr J 


,, sin kr e 

k r 


(4.77) 


Thus, the three-dimensional 8 function enables us to describe the singularity at the point 
r — 0 of the Laplacian operator applied to I /r: 




(4.78) 


In determining the potential we made use of the rule for dividing an exponential 
function by the Laplacian operator; 


-Ar* 

The validity of this rule is immediately obvious, since 


e 


ikr 




e 


V2(V=)-‘e'*' - 


,ikr 
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We shall not discuss the properties of the S function any further and shall proceed to 
use It to normalize the continuous spectrum for one-dimensional free motion. From 
(4.41), the wave functions for this case are of the form 


<|/ (p) = Ae^ , I* (/>') = Ae ^ . (4.79) 

The normalization of these functions is related to the ftfuncUon in the following way: 


^ (p’) (p) dx = A‘ ^ dxe"’^'' " = 

— OO 

= A'^'inhi {p —/>’) = a ip —p'). 


(4.80) 


From this, we obtain 




_1 

Y'lTth * 


Iherefore, 


1 ^ 

= e " . (4.81) 

y 2-Kh 

Let us compare the ordinary normalization of wave functions with the a function 
normalization. We can write the orthonormality condition for functions normalized by the 
ordinary method as follows: 





if n lies inside the range ni< n 2 , 
if n lies outside the range th < ih 


(4.82) 


Similarly, replacing the sum in Eq. (4,82) by an integral, we can obtain the following 
generalized form for wave functions which are normalized with the help of a a function: 


Pi 

^ dp' ^ |»(/V) 'ii(p)dx = 
Pi 


1, if /I lies Inside the range /ii < pj, 
0, if p lies outside the range pi P 2 , 


(4.83) 


Analyzing the solutions that we have obtained for a particle in a potential well and for 
a free particle, we arrive to the following conclusions. If the condition V > /: is satisfied 
at all points of space at infinity, the energy spectrum is discrete. If, on the other hand, 
there are regions at infinity in which V < 11, the energy spectrum is conunuous. While 
this conclusion was obtained on the basis of solutions for a rectangular potential well, it 
is quite general as long as the potential energy is a conunuous function of .v, _v and 2 , 



Chapter 5 


The Time-Dependent Schrodinger 
Wave Equation 


It has been shown that the solution of the time- independent or 
stationary Schrodinger wave equation (4.8) amounts to a determina- 
tion of eigenvalues (the spectrum of energy levels) and eigen- 
functions. The time-dependent wave function of a given state 
was found by multiplying hy e ~ The function tj>„(0 obtained 

in this way describes only strictly monochromatic processes (that 
is, with only one value of energy). There is, however, a more 
general form of the Schrodinger equation, which depends explicitly 
on time and may be used in a much larger class of problems. This 
equation is known as the time-dependent Schrodinger wave equation. 


A.*-TRANSITION TO THE TIME-DEPENDENT SCHRODINGER 

WAVE EQUATION ^ 

To obtain the time-dependent Schrodinger wave equation, it is 
necessary to eliminate the energy E from the time- independent 
equation, where it appears as a constant parameter. The time- 
independent Schrodinger wave equation (4.8) can be written in the 
form 


+ (5.1) 

The relation that is used to eliminate E from this equation is 

Accordingly, the time -dependent Schrodinger wave equation is 

This equation is more general than the previous one. In particular, 
it can be used for the description of processes in which the poten- 
tial energy V is a function of both position and time. 
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If the potential energy 1/ does not depend on time, it is necessary 
to solve only the time- independent Schrodinger equation by finding 
all the possible energy eigenvalues En and the corresponding 
eigenfunctions The wave function which satisfies Eq. (5.3) is 
related to these partial solutions by the linear equation 

(5.4) 

n 

To prove this, we substitute (5.4) into (5.3), remembering that 
are constant coefficients and that satisfies the equation 

+ = (5.5) 

It is then readily seen that 'KOis a general solution of (5.3), since 

lEJ 

= 2^"® " (5.6) 

n 


The case of a monochromatic wave is a special case of the general 
solution (5.4). The appropriate wave function can be obtained from 
(5.4) by setting 

= 1 and = 0 (if n -A th). 

As we have presented it, transition from the time- independent 
equation (5.1) to the time -dependent equation (5.3) is essentially 
equivalent to a simple replacement of the energy E by the expres- 
sion ih ~ . In quantum mechanics, this expression is known as the 
energy operator^ 

E = (5.7) 

The effect of this operator on any function amounts to an ordinary 
differentiation of the function with respect to time. Thus, E is a 
linear differential operator. In the case of a monochromatic wave, 
for which 

we have 


Et(0 = E„4.(0. 


Operators will be denoted by roman characters. 
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Hence, it is seen that the energy is an eigenvalue of the energy 
operator E. 

There are other operators besides the energy operator in 
quantum mechanics. The most important of these is the momentum 
operator 


P = 


i 


V 


n j 

t dr * 


(5.8) 


which takes its name from the fact that in the case of a free particle, 
its eigenvalue is identical to classical momentum. Indeed, operating 
with p on the wave function of a free particle [see Eq. (4.62)], we 
have 


p t (0 = x ^ =P'^ (')• 

Thus, in this particular case, the eigenvalue of the operator p is 
the classical momentum p , 

With this operator notation, which helps to bring out more 
clearly the relationship between the quantum-mechanical and clas- 
sical laws of motion, the Schrodinger equation (5.3) has the form 

= ( 5 - 9 ) 

Thus, to carry out the formal transition from the classical theory 
to quantum mechanics, it is necessary to replace the energy E and 
momentum p in the classical equation for the law of conservation 
of energy^ 


by the corresponding operators and to operate with them on the 

wave function. The operator is known as the kinetic -energy 

operator T, andT + K as the Hamiltonian operator U. For the sake 
of brevity, we shall call the latter the Hamiltonian. 


the electron is in both an electric field and a magnetic field, which is characterized 
.by the vector potential A, then, using the classical expression (1.15) for the Hamiltonian, 
the time-independent Schrodinger equation can be written in the form 



(5.9a) 
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Using this operator notation, Eq. (5.9) can be rewritten in the 
form 

(E — H)'H0 = 0. (5.10) 

For the time- independent potential V, the following relation is 
obtained: 

E^;>„(/) = /r,^.„(/) (5.11) 

and, therefore, the time-independent Schrbdinger equation reduces 
to the form 


= (5.12) 

It is apparent from this that, m stationary problems, the eigen- 
values of the Hamiltonian are equal to the eigenvalues of the energy, 
just as, in the classical case, a Hamiltonian function which does not 
depend explicitly on time is equal to the energy of the system. 


B. CHARGE DENSITY AND CURRENT DENSITY. 
QUANTUM ENSEMBLES 

In classical electrodynamics, an important role is played by 
the equation of continuity 


Jp-+v.y=0, (5.13) 

which involves the charge density p and the current density j. This 
equation basically represents a general form of the law of con- 
servation of charge. To show this, we multiply (5.13) by d'^x and 
integrate over all space 

rs ( 5 . 14 ) 

Reversing the order of differentiation and integration in the first 
integral (this is allowed since time here is only a parameter), and 
changing the second from a volume to a surface integral, we obtain 

i jpd\K-{-ji„dS=0. (5.15) 

If there are no charges or currents at infinity, the surface integral 
vanishes, and we obtain the law of conservation of charge 


e= \ prflv== const. 


(5.16) 
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We shall now use the wave theory to find an expression for the 
charge and current densities. For this purpose, we take the 
Schrodinger equation (5.3), writing it in the form 


0^ (t) 

dr 


jn_ 

‘2;Wo 




iV 

n 




Similarly, for the complex conjugate equation, we have 


(5.17) 


^*(0 

ot 


ii\ 
'2m 0 


SjVj* 


(0 


iV 

n 




(5.18) 


Multiplying Eq. (5.17) by '^■^(/), and Eq. (5.18) by '[>(/), and adding 
the two equations, we obtain 


(OMi 

■ dt 


2 m„ v{'l>(0V’i'*(0 - r (Ov'MoHo. 


(5.19) 


Comparing (5.19) with the equation of continuity (5.13), and consider- 
ing that the charge density is equal to the charge r of one particle 
multiplied by the number of particles per unit volume (that is, in 
this case the probability density), we have 


P=^'K(0'M0. (5.20) 

From (5.19) and (5.20), we find the current density 

•/= "S { 'f'* ^ 

It should be noted that for a monochromatic wave 

both the charge density 

(5.22) 

and the current density 

(5-23) 

are independent of time. 

In the case of real wave functions (']>* = ']>), the current density 
is always identically equal to zero. For instance, the current 
density of an electron in an infinitely deep, one-dimensional potential 
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well (see Fig. 4.3) is zero (/==0). This is quite natural, since 
oscillations described by real wave functions are actually standing 
waves, and standing waves cannot give rise to a particle flux. 

The case of motion of a free particle is somewhat different. 
According to (4.60), the wave function for this case describes a 
traveling wave: 


^ = L-Vie^prih^ (5.24) 

Substituting this expression for ^ into (5.22) and (5.23), we obtain 
the following expressions for the charge density and the current 
density, respectively: 




Hence, when the charge is distributed with uniform probability 
over the entire volume, its density is equal to the charge divided 
by the volume, as was to be expected. Moreover, the relationship 
between the current density and the charge density remains the 
same as in classical electrodynamics. 

Let us now explain the physical meaning of the coefficients 
which appear in the solution (Oof the time-dependent wave equation 
[see (5.4)]. For this purpose, we substitute the expression for '^(t) 
into Eq. (5.16), which serves to define the conservation of total 
charge. Then, using (5.20), we obtain 


n' n 


Since the wave functions must satisfy the orthonormality condition 


we have 




(5.25) 


We can now give a physical interpretation to the coefficients as 
quantities characterizing the probability that a particle exists in 
the quantum state n. Indeed, if we know with certainty that the 
particle is in the state = all of the coefficients C„, except 
C,,^, will have to be equal to zero; that is, we must set = If 
the particle has a nonzero probability of occurring in two or more 
states, then, accordingly, two or more coefficients will differ from 
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zero. The actual probability that a particular state is occupied is 
given by while the probability density of the distribution of 

the state over entire space is given by I !*. 

When there is a large number N of particles in a region of 
space, we have, instead of (5.25), 

n 

Here the coefficients C„ characterize the distribution of the total 
number of particles among the different quantum states. 

In this connection, we shall introduce the concept of quantum 
ensembles: namely, a collection of identical quantum states that 
are described by the same wave function A quantum ensemble 
can be used to describe an electron beam, photon flux, and so on. ^ 
Two different cases are possible. ^ 

1) All of the particles are in the same quantum state N 

and iC„|" = 0 for n / /z,,. 

2) The particles have a definite distribution among the various 
quantum states: 


|C,„r==/V„3 (5.26) 

where 



In the second case, the motions of particles, say, in the states// — /z, 
and n — fiit cannot be considered independently of one another, 
because whenever two states and'^/^. are both possible, the total 
wave function is a superposition of the individual states, namely. 

This is important in determining the total probability of a state, 
which is proportional to the product (O 'MO- la addition to terms 
such as this product will contain mixed terms of the 

form la other words, each particle will possess a 

definite probability of being in both quantum states. The mixed 
terms differ from zero for coherent 9 waves (a pure ensemble), 
and this leads to interference of the de Broglie waves, which does 
not happen with incoherent waves (a mixed ensemble). Thus, in a 
pure ensemble, waves are added, whereas in a mixed ensemble, 
intensities are added. Quantum ensembles are useful with regard 


^See D. I Blokhintsev, Principles of Quantum Mechanics (trans ), Allyn and Bacon, 
1964 
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to the statistical interpretation of the results of wave theory. For 
a large number of particles or a large number of states occupied 
by single particles, it follows from the law of large numbers that 
the probability of some particular process, when calculated on the 
quantum-mechanical basis, should agree with the distribution that 
can be observed in experiment (see Chapter 7). Similarly, in 
classical statistical physics, theprobability distribution agrees with 
experiment independently of the nature of the “hidden’’ parameters. 


C^CONNECTION BETWEEN THE SCHRODINGER THEORY 
THE CLASSICAL HAMILTON-JACOBI EQUATION 

The Hamllt on-Jacobi equation, which is used in classical me- 
cha^hics to' describe the motion of a material particle in a field of 
fore®, is a firsJL-oxder nonlinear differential equation. We shall 
briefly •recapitulate its derivation, sfarting from the classical law 
of conservation of energy 

Let us introduce the action function 

t t 

S(0=| Xdt=^ (2T — E)dt = S-~Et, (5.28) 

where 

t 

S=^2Tdt. (5.29) 

0 

As we shall show, the dependence of S and t in (5.29) is only im- 
plicit, namely, 5 = 5(v(/), z(0), and therefore we have written 

it without including / as an argument. Thus, we can distinguish 
between the stationary action function 5 and the time-dependent 
action function (/) in the same way as we distinguish between the 
stationary wave function and the time- dependent wave function. 

To show that 5 is not an explicit function of / in (5.29), we take 
the total differential of S: 


From the equations 


dS^2Tdt. 


dS^^ 


dS j. , dS , 


dS , , dS , 

^ydy^ ^-dz. 


2Tdt = mo (.i-^ -\- tf -f z^) dt — p^^dx -f p^dy -j- p^dz, 


(5.30) 
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' ^ iR H S ^ C/^ tt>4c tiuh ^>tv3 U) 


dS 

dt 


= 0 


and thus 5 does not depend explicitly on f. Moreover, 

p==VS. (5.31) 

From Eqs. (5.31) and (5.27), we obtain the stationary Hamilton- 
Jacobi equation 

2i-(V5r+V'-C = 0. (5.32) 


We can introduce the time-dependent action function 

S{t) = S~Ef 

into this equation by making use of the relation p = y S = \/ S(i), 
and eliminate the parameter E with the help of the relation 

E = — ^ obtain the time-dependent Hamilton- Jacobi 

equation, which can be used in nonstationary problems: 

2,V[v5(O]‘ + V'+^]^ = 0. (5.33) 

The stationary and time-dependent Hamilton- Jacobi equations cor- 
respond to the stationary and time-dependent Schrddinger equations, 
respectively. 

It can be readily shown that in the case of motion of a free 
particle (F = 0, p = consl, /:=rconst), the action function is 

S(0 = -£/+pr. (5.34) 

To see this, it is sufficient to substitute the expression (5.34) for 
S(t) into (5.33). Let us take special note of the fact that the function 
»]>(/) for the motion of a free particle is [see (4.60) ] 

» /M A - 4-S(t) 

(^(t) = Ae^ =Ae^ . (5.35) 

The relationship between the wave function and the action func- 
tion will hold in general form whenever we make the transition 
from the Schrodinger equation to the Hamilton- Jacobi equation. We 
shall consider further only the stationary case, for which 


* 5 

^ — Ae*^ ■ 


(5.36) 
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We start with the Schrodinger equation in operator form 

Making use of the momentum operator p = — we obtain from 
(5.36) 


p'. = (VS)^, (5.38) 

We note from Eq. (5.38) that Eq. (5.36) leads to the same relation 
between the momentum operator p and the action function S as in 
the classical theory if we replace the momentum operator by the 
classical momentum p. Substituting (5.39) into (5.37), we obtain 

^ (5.40) 

which is simply the transformed Schrodinger equation. 

To obtain the Hamilton-Jacobi equation, we must neglect the 
last term in (5.40), that is, set ^==0. It is a well known fact that 
the quantum-mechanical equations transform exactly into the 
classical equations when H—O. If, on the other hand, h ■/: 0, but the 
condition 

(V5)V'-;j|V^5| 

is satisfied, the quantum-mechanical terms provide only small 
corrections to the classical equations. The approximation corre- 
sponding to this case is knownas the^z^^isZ-cZassmaZ approximation. 
Since /; = V5, the above condition can be rewritten as 

In particular, for the one-dimensional case, we have 


dp _ 

- — 

d\ 

dx ~ 

dx 

2ndx 


(5.41) 


Thus, the quasi-classical approximation turns out to be sufficiently 
accurate in cases where the de Broglie wavelength is constant or 
changes very little over distances of the order of the wavelength. 
Since 


p = ]/2mo (£ — I/), 
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we can rewrite the condition (5.41) in the form 


y 


dp _ 

ntoFh 

dx 



(5.42) 


where f classical force acting on the particle. From 

this, it follows that the quasi-classical approximation becomes 
inapplicable at small values of the momentum and, in particular, at 
the points where a particle would come to a stop in the classical 
theory (£ = F, p = 0). Such a state of affairs is obtained, for 
example, in a potential well at the points where the direction of a 
particle is reversed as a result of reflection from the potential 
barrier (the turning points). A simple explanation can be given for 
this conclusion: namely, the de Broglie wavelength tends to infinity 
as p — 0 and, when this happens, the wave-like properties of a 
particle become too important for the particle to be treated quasi- 
classically. 


D. THE WENTZEL-KRAMERS-BRILLOUIN APPROXIMATION 
— — 

As mentioned above, Eq. (5.40) is completely equivalent to the 
Schrodinger equation. Therefore, it would be possible to take 
Eq. (5.40) as the basis of the wave theory by treating the term 
which is proportional to fl and which does not appear in the classical 
equation as a new quantum-mechanical potential energy 

l/quan (5.43) 

which has to be added to the Hamilton- Jacobi equation. The general 
solution of the nonlinear Schrodinger equation (5.40) is, however, 
much more complicated than the solution of the linear Schrodinger 
equation, and therefore the many attempts to develop the quantum 
theory by means of further analysis of Eq. (5.40) met with failure. 
Fortunately, Wentzel, Kramers and Bnllouin succeeded in finding 
an approximate solution of the Schrodinger equation (5.40) by taking 
only the terms of the order of H. This solution was found to be 
applicable to a number of problems in quantum mechanics. It is 
referred to as the WKB approximation. 

For the sake of simplicity, we shall consider only the one- 
dimensional case, assuming that the potential energy is a relatively 
smooth function of a' (see Fig, 5.1). For particles with energy £, 
the range of variation of x can be divided into two regions. In the 
first region (a<Ao), the energy E is greater than the potential 
energy (£>F), and in the second region (a >Ao), £<F. It is obvious 



84 


NONRELATIVISTIC QUANTUM MECHANICS 


that B=V(Xn) at the boundary {x — Xq) of the two regions. For the 
one-dimensional case, the original equation (5.40) becomes 


— iflS'^ = 2t?h{E — V) = p\ (5.44) 



First, we shall find the solution of this equation for region I 
(F>F), where the quantity />^>0 can be interpreted as the square 
of the classical momentum. The solution will besought m the form 
of a series 


S — “1~ *^1 *^2 ■f' • • • » (5.45) 

where the quantity is independent of //, S, is proportional to /Sr. 5^ 
is proportional to etc. Substituting the series (5.45) into (5.44) 
and neglecting quantities proportional to ti" and to the higher powers 
of //, we obtain’ 

H itlS^^ ~ f)\ (5.46) 

Taking both sides of the equation, we equate terms that are inde- 
pendent of h and, similarly, terms that are directly proportional 
to h (here it is necessary to bear in mind that the quantity is 
proportional to h). We thus obtain 

= p\ 2S,S, = ihS'o. (5.47) 

Hence, it follows that 

Sg — ± ^ = (5.48) 


4 

This approximate method of solution is basically the same as the perturbation method, 
which IS also successfully used in the solution of the Schrodinger equation (see Chap- 
ter 14 below). 
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Therefore, retaining only terms of the order of we have 

Ao 

S = S.j -!- Si = ± ^ pdx-\-ih In Yp. (5.49) 

A 

Substituting (5.49) into (5.36), we obtain the following expression 
for the wave function in the first region (a'<^.Vo): 

'o Ao 

= COS i /W.vH sin j j pf/vV (5.50) 

In exactly the same way, for the second region (x^Xo), in which 
P'C^O, we obtain 


, -is 

= -hCe ■'« ), ( 5 _ 51 ) 

where 

\p\===V^2m;(v~~'n): 

The wave functions (5.50) and (5.51) are the desired approximate 
solutions. From these equations it is seen that, when the 

wave function is cosinusoidal or sinusoidal, as in the case of a 
potential well [see Eq. (4.21)] or a free particle [see Eq. (4.47)], 
whereas, whenF^E, it changes exponentially, as in the case of 
transmission through a potential barrier [see Eq. (4.23)]. 

Comparing the solutions for F= const with the solutions for the 
case where the potential energy is a function of v, we see that the 
transition from one case to the other is simply equivalent to re- 
placing the area of the rectangular barrier contained between the x 

axis and an axis indicating the constant quantity x = == 

n H 

by the area enclosed between the x axis and the curve for V -^V (x). 
Schematically this transition can be represented in the following 
manner: 


-n^~’i^\p\dx. (5.52) 

0 

A similar transition is made in the case of a potential well. 

Thus the specific form of the dependence of the potential energy 
on x does not alter the character of the solution. Indeed the solution 
is determined only by the sign of the difference between E and V, 
as we indicated at the end of the preceding chapter. 

The solutions (5.50) and (5.51) give a good approximation only 
for regions that are relatively far from the special point Xq (the 
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classical turning point) where the quantity |p| is relatively large. 
Near the special point the quantity p'->0; therefore, the 

denominator in Eqs. (5.50) and (5.51) vanishes and the actual 
solution diverges. If we could express the coefficients C and Z^in 
terms of A and B, the foregoing approximation would be entirely 
adequate for many problems, since the region lx — Xol — Ois com- 
paratively narrow. However, the relationbetween these coefficients 
can be found only if we can connect the functions across the boundary 
(x = Xq) of the regions (by connection of the solutions we mean 
matching the wave functions and also their first derivatives at the 
boundary x->Xo). The approximate expression for must, there- 
fore, be represented in such a form that at large Eq. (5.50) holds, 
while for x=Xq, when 

= — (x — Xo) 2mo V'' (Xo) = — 

the approximate solution satisfies the equation 

f' — a(x-Xo)']> = 0. (5.53) 

At large 2 , the cosine function can be expressed asymptotically 
in terms of an nth order Bessel function: 

cos [z - - 1 ) ^ J„ (z). (5. 54) 

Therefore, if we set 

z=j^^pdx, (5.55) 

X 

then 

= (5.56) 

For large z (and any n), because of the asymptotic formula (5.54), 
the solution (5.56) transforms into the solution (5.50) found by the 
WKB method. Let us attempt to choose the order n of the Bessel 
function in such a way that the solution (5.56) satisfies the Schro- 
dinger equation not only for large z, when x .^Xo, but also near the 
turning point x— >Xo, that is, when 

p^^ — *0 and g = ^ ^ - (Xq — xf^^ 0. (5.57) 

In this case (x-^Xq— - 0), the asymptotic expression for becomes 


Jn Y/ (Xo - xr>). 


(5.58) 
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Substituting (5.58) into (5.53), we find that Jn must satisfy the 
equation 


d‘Jn 


d.v" 

jVo — .V dx ‘ \ 


x)- 


1 


4 (.Vo —x) 


;)y„=o. 


Introducing the new variable 


-LV 


z = 


\ — (the argument of the 


Bessel function) into this equation, and denoting by primes the 
derivatives with respect to this argument, 


= (5.59) 


If the Bessel function is to satisfy Eq. (5.59) and if, at the same 
time, the wave function is to obey the asymptotic equation (5.53), 

we must put /j = ±;4-. Thus, instead of the approximate solution 

o 

(5.50), we have 


I*- 




Similarly, in place of (5.51), we obtain for the second region (/:<^V) 


- 1/ j \P |rfv{c/v, (^- J I I dx) + 

Ao .Vo 

X 


(5.61) 


where / 1/3 and /_i/, are the Bessel functions of an imaginary argu- 
ment. 

In order to connect the two solutions, we must find the asymptotic 
forms of Eqs. (5.60) and (5.61) for the region The appro- 

priate values of 2 andp are determined from (5.57). For the Bessel 
function, it is enough to take only the first term of its expansion: 

•'.W='.<*l=f,/+T)(7r- (5.62) 


The solutions (5.60) and (5.61) become, respectively, 

/I /IT (Sac)'/* B y¥ 


'i'.(=;ro-0 = 


r'' r(I/3) 

C /"iT 
ft‘'=r(i/3) 


(JC, — a:) + 


^i/s(3a)'/,r(2/3)’ 

D 

ft‘^!!(3a)‘/* r ci/a)’ 


’l’;(=.jco+0 


(5.63) 
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Connecting the solutions at the point x — we find 

D = B, C = — A. 

Considering the asymptotic forms of the ordinary Bessel function 
[see (5.54)] and of the Bessel function of imaginary argument^ 

(?) ^ -y - (e' -f e" cos s (n -f i)) - 


^Thc Bessel function of imaginary argument is related to the Bessel function of the 


first kind by the equation 


where 




The asymptotu' form of the Bessel function with x ^ calculated either for 

(/i t 0 


'^,(0 


• for </) 0 

2 


,i7/(n + ‘/,)^ cos(^.x 1 - 7r(n 1 1 / 2 )^ for - —n. (5 64a) 

ln^ \ 2 / 2 2 




- cos^A ^ n in \ 


1 n , n 

,' 2 ) ) lor 0 - 

2 2 


(S 64b) 


(see, for example, F. Morse, and H Fcshbach, Metfunh of Tlicomtical New York 

McGraw-Hill, 1953, Vol I, p 622) 'Fherefore, the asymptotic form of f„ for real ? has a 
discontinuity (Stokes phenomenon) Indeed, using (5 64a) and (5 64b), we find for the 
two cases 





\/2ttc 2 


(5 64c) 


(5 64d) 


Unfortunately, many authors, including Morse and Feshbach(see \1pthods of Theoretical 
P/ivsics, Vol II, p 1097), use (5 64c) in analyzing the passage of particles through a po- 
tential barrier and thus obtain a complex asymptotic expression for the real function /„(^). 

The correct procedure at 0 (where the function has a discontinuity) is to take the half- 

2 

sum of (5 64c) and (5 64d) in order to obtain the asymptotic expression (5 6 4), which, as 
expected, is real 
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and also taking account of the relation between the coefficients, we 
find that for large z, the formulas (5.60) and (5.61) take the form 

f (1 J pdx - §) + 

+ 5cos (5.65) 




2 V\P \ 


-1 ^ 1 p 1 rf.V 

{{B~A)e -i- 


~-n S ' ^ 1 

+ (i5-f /l)cos -^0 J. 

Setting B = A = -^ in the last equality, we find the first pair of 
connected solutions 


= (5.66) 

--ij lp|rf.f 

for which the exponentially decreasing solution (5.67) in the region 
a:>Xo is the analytic extension of the sinusoidal solution (5.66) for 
the region a: <^. v„. 

To determine the analytic extensionof the exponentially increas- 
ing solution (x>Xo), we must set B==- A —b. We then obtain the 
second pair of connected solutions 

^ ^ + "4 ) ’ 

X 

1 \ I'’!''* 

According to Eqs. (5.66) and (5.68), the expression for <)( in 
region I (x<[xo) is of the form of standing waves. It can also 
be written in the form of traveling waves. Indeed, setting 

* = (fi'+A). a—i(g — h). 


(5.68) 

(5.69) 
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we have 


Yx<xo 


Yp 


=■ [ge 




-{■he 


-a 


5 prf^+ 7 ) . 

X \ 


(5.70) 


The appearance of the factor in the denominator of (5.70) means 
that the probability of finding a particle in a unit volume (that is, a 
quantity proportional to |'>p) is smaller, the greater the velocity. 
Inside the barrier the exponental solution that is connected to (5.70) 
takes the form 


Xq 


4 


1 

V\p\ 


X 


i li 

{g+li)e •>« 


" 2 



(5.71) 


In cases where the potential barrier is located to the left of the 
turning point, we must interchange the limits of integration, so that 
the lower limit will always be less than the upper limit. 

On the basis of these results, we are able to quantize the problem of the potential well 
(that is, find the energy levels of the particles) in the WKB approximation. Let us assume 
that we have a potential well of arbitrary shape, as illustrated in Fig. 5.2. Obviously, 



Fig. 5.2. Quantization in a potential well according 
to the WKB method 


the process of quantization in the WKB method will consist in finding the conditions under 
which the exponentially increasing solutions on both sides of the potential barrier (a'< a'i 
and .V > A's) vanish. 

According to (5.66), the wave function for this case has the following form in that 
region of the potential well which is adjacent to the boundary of the barrier at a* = 
(there being only an exponentially decreasing solution inside the barrier); 








(5.72) 
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Because of the requirement of an exponentially decreasing solution inside the potential 
barrier in the region v - Xi, the following solution is obtained for the potential well in the 
region x:>Xi: 

X 

'I 

Two solutions must be identical at any arbitrary point x in the potential well (vj <;a*<Ca*s^, 
as long as we do not take points too close to the boundary of the potential barrier. Joining 
the solutions (5.72) and (5.73) at some point r(that is. matching the two wave functions 
and also their derivatives), we have 


Xi X 

I ^ t) - “ (i + f ) 

X Xi 

a' cos^-i + o + j) = 0. 

X Xi 

If this system of homogeneous equations is to have a nontrivial solution for a and the 
determinant of the system must vanish. We then obtain the relation 

X% 

sin(i. J /,rf.c+|) = 0. 

Xi 

Aa 

Tlie integral^ p <fA* cannot be negative, since p== Y'2tihjE~‘^V)^Q. Hence 

•^1 A'a 

1 J pd,v- + I = (« + !)«, « = 0, I, 2 (5.74) 


Thus the quantization rules obtained from the WKB approximation (that is, with an 
accuracy up to the terms of the order of H) have the form 


(j) p (lx = 2n1i -f j . 


(5.75) 


These quantization rules differ from the Bohr quantization rules by the presence of a 
nonzero term 2nti, which corresponds to the lowest state (n = 0). A more exact solu- 
tion of a similar problem in wave theory (for example, the harmonic oscillator problem) 
shows that a zero-point energy necessarily exists, although it does not affect the radiation 
spectrum. 

Let us now find the normalization coefficient of the wave function for the case of a 
potential well. In normalizing the function, we can restrict ourselves to integration over 
the interval Xi ^x^ (potential well), since decreases exponentially everywhere 
outside this region. The normalization coefficient can be then found from the equation 


A2 X 

sin»[l +|| = 1. (5.76) 

Xi Ai 


Since the sinusoidal function oscillates rapidly, its square can be replaced by its average 
value, 1/2, without significantly affecting the accuracy. We then have 



Xi 
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“ J " iP 

Jf, Xi 


2r, 

The oscillation period t = - (<*> is the angular frequency) is 

0 ) 

2k .. ' 

•c = 

(0 

Xi 

where t; = is the velocity of the particle. Hence^ for the normalization coefficient, we 

tfio 

obtain the expression 

1 /~2<^rtto 

^ r ‘ 

Consequently, the eigenfunction (5.73) in the WKB approximation assumes the form 






(5.78) 


Prob l em 5- L| Determine the eigenfunctions and eigenvalues of the energy of a particle 
iti'irffireS-afflfeiis tonal potential well bounded by an infinitely high potential barrier: 

fO forO<A*</i, 0<y</3, 0 < 2 < /g, 

' ’ ’ ( oo outside the potential well. 


Find the conditions under which different wave functions correspond to the same energy 
value /:, that is, the conditions under which the energy levels are degenerate. 

Answer, 






I j_ 'j1\ 

a 2nio \li "T /j /«;» 


nin»ni 

8 

lii'iU 

(n,, no, 2, 3,...). 


KtliX , KtUiV TMiZ 

sin-~^ sin —7^ sin — p- 
li h U 


In particular, when Aj = Zi, we have degeneracy for the case of the two different wave 
functions 


F — F 

Problem 5.2, iFind two classes of solutions (symmetric and antisymmetric) for a 
"!!cISTtrtl WILiiylmensional symmetric potential well: 


l^(.v)=- 


Vo for ^ < — / (region 1) 

0 for — l<:x c I (region II) 
Vo for x:>l (region III) 


(5.79) 


Show that, when Vq can have any arbitrary value, only the symmetric solution must 
always have at least one energy level. Find the condition under which an antisymmetric 
solution IS possible. Show that the antisymmetric solutions corresponding to .v>0 also 
represent a complete solution of the system in tlie case where V (x) is described by 
the function 


I CO for A* < 0, 

0 for 0 < A < /, 
Vo for X > I, 


(5.80) 


llioso problems are based on the material contained in both Chapters 4 and 5 
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Plot a graph of the potential energy and of the symmetric and antisymmetric wave 
functions. 

Solution. The wave functions in regions I, II and III have the form 


4/,= 

tpjj =1 A2 sin kx + Bi cos kx (— / < jc < /), 

+,„=C.e- (.r>/), 

where 

* = ]/-■»“- • * = • 

The exponentially Increasing solutions have been discarded because they diverge (indeed, 
this is the factor that is ultimately responsible for the discrete values of the energy). 


From the boundary conditions at a: = - 4-/, it follows that j,, (a') = «l/ii(jc) and 

1. niW _ ^ From this, we find that it is possible to have either a symmetric 

dx dx 

solution, for which = — a*), or an antisymmetric solution, for which <Ka) = 

= ~4/(— a). 

For a symmetric solution As =0, Cs = Di , and the energy levels are found from 
the equation _____ 

tana=|/"£^, (5.81) 

where 

Since — a*/a > 0, the minimum value of the angle a must lie in the first quandrant 
and can be found from the condition 

I a 

cos g = =7r. 

/ l4tan«o P 

This equation will always have one root for any value of p in the region 0 < o C as is 

easily shown graphically from the fact that = const > 0. 

For the antisymmetric solution (a) = — ']^ ( — a)], we have^a = 0 and C* =— Z!li, 
Tlie energy levels are determined from the equation 


tan a = -- 



(5.82) 


Since the right-hand side of (5,82) has a negative sign, the minimum value of the angle a 
must now lie in the second quadrant and can be found from the condition 


sin g = 


tan g 

Y 1 4* tan* a 


n 



Introducing the notation 


’=2 


where the angle 7 lies in the first quandrant, we find 


cosTf=2^ + i. 
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It is evident that this equation has at least one root only when 

o It , r rr 

? > ^ , Ue. . for Vo > g — f . 

Since the wave function of tlie antisymmetric solution vanishes at ^' = 0, the antisymmetric 
solution for -r >0 is also the solution for a particle In a potential field described by 
(5.80). 

VPr€>blem 5,3^ Find the transmission coefficient D of a particle through a potential 
bamUl UrP'dLi angular shape (see Fig. 5.3), if the particle energy /: is less than the height 
of the potential barrier Kq. 



5.3 Transmission of a particle through a potential 
barrier 


Solution, dhe solution of the Schrodinger equation for the various regions has the form: 
H for .V < 0 (region I) 

1 for 0<A'<« (region II) 

i|;,„ + for A*>fl (region III) 

Here "^^ 2 ^ (^o ~ and characterize the incident and 

reflected waves, respectively; characterizes the transmitted wave; and 

characterizes the reflected wave coming from infinity. Since we have no 
reflected wave from infinity in our case, we must set U^ = 0, 

'lo determine the transmission coefficient we shall use the boundary conditions at 
A ~ a and .V 0. We first express An and Ba in terms of /la, making use of the fact that 
ma ; I : 


/l3 = — 2— 
Ba— — 


and then express Ai in terms of ^Is 




(i-«(i+i) 

4“ 


e^^Ai. 


The transmission (diffusion) coefficient D is then found to be 




(1 + «T 


’ = e.\p^- 


2%a + In 


I6n* \ 
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where 



X 



Neglecting the second term in the exponential for D(this is possible because the quanuty 
1 6 / 1 ^ 

only slightly different from unity), we finally obtain 


2mi,{Vo-E) 

D^e 

When ft = 0. get the classical result D = 0, 

I Problem 5.4,1 A particle is in a potential field of the form 


cr», A* < 0, 

0, OCX Cl (region I) 
Vo, icxch (region II) 
0, hex (region III) 


(5.83) 


a) Show that the spectrum is continuous. 

b) Show that for x/ > 1 and y (h — l) = Ya > 1 (0</:c'Ko) there must exist quasi- 
levels, that is, states such that there is only a transmitted wave in region 111 (see 
Problem 5.3X 

c) Construct a graph of the wave function corresponding to a quasi-level, 

d) Find the squared modulus of the wave function inside the potential well and 
explain why it decays exponentially with time. 

Solution, In the various regions, the wave function has the following form (see the 
notation of Problem 5.3): 

sin /tA, 

From the boundary conditions, we find 



These relations hold for any value of E, and therefore tlie spectrum is continuous. 

We obtain quasi-levels by requiring that there should be only a transmitted wave in 
the external region (region III), that is to say, only a wave moving in the positive a direc- 
tion. Setting /)j = 0, we obtain the following equation for the energy of the quasi-levels: 


tan a 






a'J 

X 


X 


fiH — a* — la \ Infill — a‘/ 


(5.84) 


It should be noted that the amplitude Ci In this case Is much smaller than the amplitude 
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For fl = OQ^ Eq, (5.84) becomes exactly the same as Eq. (5.82) which gives the energy 
levels of the potential well (5,80X 

We let denote the value of the energy of a particle in the potential well which we 
obtain from (5.82) and, for present purposes, we disregard all re^ corrections to Eq, as 
they make no essential difference to the problem. We then obtain the following expression 
for the energy: 

— -im, (5,85) 


where 


and where 


O ^ (Vo - Eq) tnoV^ 

^21 » 


If 


16 / 1 * 



under the condition x/ 1, 

Ihe presence of an Imaginary part in the expression (5,85) for the energy indicates 
that the wave function decays exponentially with time. The transmission coefficient 
through the potential barrier for this case can also be found from (5,85), Indeed, the 
squared modulus of the wave function Inside the well is 

I ^ l* = const-e'^^, 


and thus X, which Is called the decay constant, characterizes the decrease of the prob- 
ability of finding the particle inside the potential well. The quantity A is related to 0, the 
transmission coefficient for a single collision of the particle with the potential barrier, 
by the equation 


21 




where is the number of collisions with the barrier per unit time. Hence we obtain an 
expression for D: 

Ihe same expression for D was obtained in another way in Problem 5,3, In Chapter 6 we 
shall obtain it for the general case in which the potential energy is an arbitrary function 
of the coordinates. 

"Hie constant A also determines the duration of radiation | (see also Problem 

3,4), Therefore, the quantity AZ: 'v. h\ should characterize the width of an energy level. 



Chapter 6 


Basic Principles of the Quantum Theory 
of Conductivity 


A. TRANSMISSION OF A PARTICLE THROUGH A 
POTENTIAL BARRIER (TUNNEL EFFECT) 

According to the classical theory a particle can be located only 
at those points in space where the potential energy V is less than 
its total energy £. This follows because the kinetic energy of a 
particle 

4-=E-V ( 6 . 1 ) 

must always remain positive. In a region where k > C (a potential 
barrier), the momentum has an imaginary value and, classically, 
the particle cannot exist in such a region. Therefore, if two regions 
of space in which E^V are separated from one another by a 
potential barrier inside which k>i:, the classical theory does 
not allow a particle to penetrate from one region into the other. 
In^the,^ wave theory, l^wever, an imaginary value of the r^^omen- 
tmn^ (see the" WK6 lip^roximafion'^ metRodT* ChS^eT^?^ smply 
corresponds "fb an exponential (Sep^IidelTce^f ffiejyave functmn^^ 
tKe^6?Trdl£afes/"Since the v/ave TuflfctiM does not vanisTi inside the 
potential barTier, it is quite possible for a particle to leak through 
it. This phenomenon is observed in the case of microparticles. 
The penetration through a potential barrier is called the tunnel 
effect. It is a specifically quantum-mechanical effect and has no 
analog in classical mechanics. 

With the WKB method, it is a relatively simple matter to deter- 
mine the probability of penetration of a particle through a potential 
barrier of an arbitrary but sufficiently smooth shape (see Fig. 6.1). 

Let us assume that a particle is moving in the direction of 
positive X in the region I where £>F(x). It en- 
counters a potential barrier where E<^V(x), at the 

point and then falls into the region III where again 

E'^V (a). The beginning and end points of the potential barrier can 
be found from the condition 


F {x) = E. 


( 6 . 2 ) 
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The de Broglie waves corresponding to the motion of this particle 
will be partially reflected from the potential barrier and partially 
transmitted through it. The transmitted waves will then propagate 
in the region III To determine the probability of penetra- 

tion of a particle through the potential barrier, let us begin by 



r'lft () I Schematic dia^nini of a potential barrier 
of an aibvtrary but sufficiently smooth shape 
The in( ulent and transmitted waves are repre- 
stMited by the solid curve, and the reflected wave 
by the dashed curve 


analyzing the wave in region III, where the solution has the simplest 
form, since in this region there can be no wave moving in the 
direction of negative x. According to Eq. (5.70), the solution in 
region 111 has the form 


ijiin — 


1 

Vp 







V 



(6.3) 


where 


p^\'2in,(E-V(x))\ 


(6.4) 


We define the transmission coefficient as the absolute value of the 
ratio of the flux density of the particles transmitted through the 
barrier to the flux density of the incident particles: 

^ = 1-^1- (6.4a) 

To determine the particle flux, we shall make use of Eq. (5.23). 
Setting the constants e and nio equal to unity (which we are allowed 
to do because we are interested only in the ratio of the particle 
fluxes), we obtain 
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/ 



dx 


)■ 


Substituting (6.3) into (6.5), we find 

for /i,„ = 0, 

/ill “ I ^*111 1* for ^111 = 0 


(6.5) 


( 6 . 6 ) 

(6.7) 


It is seen from the above that the amplitude characterizes the 
wave propagating in the direction of positive and the amplitude 
li[n the wave propagating in the direction of negative x. Since, as 
has been already mentioned, there is no wave propagating in the 
negative x direction in region III, we must set //iii = 0. Then 


/tr=lgii,r. (6.8) 

and the corresponding joined solution inside the potential barrier 
(region II) can be found from Eq. (5.71). Setting and in 

Eq. (5.71), we find 


tS 'rl'''' , - 'n S I''!"' 

_ V t * - * V 


where 




]. 


(6.9) 


(6.10) 


Using the equation 


IJ |^ldv--T- j J (6.11) 


where 

jr. .Vj 

r = ^j| r2m„(l/_£yr/.v, (6.12) 

Xi 

the solution (6.9) can be put into a form such that it can be connected 
to the solution in region I by means of Eq. (5.70). Thus we have 


Vii = 


r^p 


i[- 


giii«'r 


- \ 
h 3 


p I 




h J 


\pl (fv 


]• 


(6.13) 
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According to Eq. (5.70), the solution in region I has the 

form 




1 

Vp 



(t5 


t ) 


-\-hie 



(6.14) 


Substituting this solution into (6.5), the following ejq)ressions for 
the incident and reflected waves are obtained: 


/inc = |/'.P; (6.15) 

/ref=— Ifir.P. (6.16) 


Equations (5.70) and (5.71) can be used to relate the coefficients 


From this, we find 


£r,and/i, with g,„e''and 


^1 + ^*1=4' ^111®'^' 

(6.17) 


(6.18) 


(6.19) 


(6.20) 


In accordance with the definition of the transmission coefficient 
(6.4a), we have 




I Ar ( 

1 Anc I 


1^1 


1 



( 6 . 21 ) 


In exactly the same way, we find the reflection coefficient 


R 


j/ref 1 _ 


l^inc ^ 



( 6 . 22 ) 


From these formulas, it follows that the sum of the transmission 
and reflection coefficients is equal to unity: 

/? + D=l. 


(6.23) 
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For cases where the quantity 7 is much larger than unity (these 
are the only ones of practical interest), the transmission co- 
efficient (6.21) is given by the expression^ 

Xy 

] V2mo{V-E) dx 

Dg^e-^^ = e . (6.24) 

In the classical limit (/i — 0), it is evident that the transmission 
coefficient becomes zero, as we would expect, and thus the 
penetration of particles through the potential barrier is impossible. 


B. THE TUNNEL EFFECT AS A MANIFESTATION OF 
WAVE PROPERTIES 


The penetration of particles through a potential barrier is a 
typical manifestation of their wave properties. Therefore, an 
analog of this effect must occur in 
every type of wave theory. 

In optics, this analog is the well- 
known phenomenon of total internal 
reflection, which occurs when light is 
reflected from an optically less dense 
medium. Let us assume that a ray of 
light propagating in glass strikes a 
glass-air interface (air being the op- 
tically less dense medium). (See Fig. 

6.2.) Then the wave field, which can 
be characterized by the electric or 
magnetic field strength, is described 
in glass (where the index of refraction 
1) and in air (/i~ 1), by the follow- 
ing equations, respectively: 



Fig. 6.2. Propagation of light 
from a more dense to a less 
dense medium. 


no.n 

— /nij/ — (V sin Oj -j- V cos Oj) 


<pi=Aie 


- 10)11^ -f- 4 ' Sin Oji - 1 - j' cos Oil) 


(6.25) 

(6.25a) 


In solving the similar problem of penetration of a particle through a rectangular poten- 
tial barrier (see Problem 5 3), we obtained the same exponent as in (6 24), but in front of 


the exponential there was a factor 


16n^ 

(1 + ''^)^ 




which was of the order of 


unity In the case of a smooth barrier (that is, a barrier exerting a force which is continu- 
ous at all points), this factor becomes exactly unity 
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Equating these functions ^ at the interface (at the plane ^ = 0), 
we obtain the familiar laws of refraction: 

(oii“OJi; sin 6ii = n sin Oi; cos On = K 1 — sin-* Oj. 

Substituting these values into (6.25a) we find for the refracted 
wave 

— j (v/i sin 0| }- V V^l — sm- 0 ) 


From this it is seen that, if nsin0|<^l, an ordinary wave will be 
propagated in region IL In the case where nsinOi^ 1, we have total 
internal reflection, the physical analysis of which can not be given 
on the basis of the laws of geometrical optics. From the standpoint 
of physical optics, which accounts for the wave properties of light, 
the electric and magnetic fields are exponentially decaying: 

(Oj _ „ /UJ[ 

Y sin^ 0i — 1 V — -I' — - xn sm 0i 

<Pn = e ^ I I f 

If in the case of total internal reflection the refracted wave en- 
counters a second glass surface (region III) (that is, we have two 
pieces of glass separated by an air layer), then in region III the wave 

is again propagated according to Eq. 
(6.25). Its amplitude, however, will 
be an exponentially decreasing func- 
tion of the width of the air layer I . 

— sin2 Oi — W t£* 

Aiii = Aie " . (6.26) 


A diagram of the tunnel effect in 
optics is shown in Fig. 6.3. 

Before concluding the physical 
analysis of the penetration of par- 
ticles through a potential barrier, 
we should also consider the so- 
called ‘ ‘ tunnel effect paradox. ’ ’ This 
paradox lies in the fact that, at first 
glance, it seems as though the real 
classical particles inside the poten- 
tial barrier are in a peculiar state characteristized by an imaginary 
momentum. However, it is important to remember that in this 
purely quantum-mechanical phenomenon, it is only the probability 
of a particle being somewhere inside the barrier which decreases 
exponentially as we recede from the boundary into the potential 
barrier. Inside the barrier, the momentum and position of a particle 
are real, and are both given within the framework of the ordinary 


2 

We shall not write here the expression for the reflected wave. 


Incident 



Fip;. 6.3. Schematic diagram of the 
tunnel effect in optics (total inter- 
nal reflection). 
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uncertainty relation. In order to show this, let us consider a 
potential barrier with constant V. In the first approximation, the 
wave function inside the barrier changes according to the equation 

?u=Ae^, (6.27) 

where 

y = -~l/'2mo(V — F) = const. (6.28) 

The right-hand side of (6.27) can be represented in the form of a 
Fourier integral 


where 




f{k)= 


2 % 

r. + k-) • 


(6.29a) 


which means that it can be represented as a set of wave functions 
having real momenta. Obviously, the amplitudes / (/e) will effectively 
differ from zero only when k varies in the range from 0 to x. 

Thus the uncertainty in momentum is 

According to wave mechanics, the position of a particle inside 
the potential barrier can be determined only to an accuracy within 
the order of the width of the barrier: 

l = X2 — Xi, 

Multiplying Ap and Aa- we obtain 

ApAA:^-^ h'd. 

Since our equations are valid only for the case x/ > 1 , the accuracy 
in determining the momentum of the particle audits position inside 
the potential barrier will not contradict the uncertainty relation. 


C. MOTION OF ELECTRONS IN A METAL ^ 

The theory of the tunnel effect has a number of very important 
applications both in the theory of metals and in nuclear physics. 
On the basis of this theory it is now possible to explain a number 
of phenomena which could not be accounted for in classical physics, 


^See H. Bethe and A. Sommerfeld, Elektrontheorie der Metalle, Uandbuch der Physik, 
Berlin: Springer, 1933, Vol. 24, part 2. 
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such as cold emission (the emission of electrons from a metal 
under the action of an electric field), contact potentials, etc. Before 
discussing these phenomena, we shall say a few words about the 
theory of an electron gas, which underlies the electron theory of 
the conductivity of metals. 

The high conductivity of metals indicates that electrons are able 
to move relatively freely inside the entire crystal lattice of a metal. 
Their excape from the metal into vacuum is, however, hindered 
because this requires the expenditure of a certain energy, the so- 
called “work function.” This suggests that as a first approx- 
imation, we may simply consider the metal as a potential well, 
inside which the potential energy of an electron can be taken as 
equal to zero (1/ ~0), while outside the metal (that is, in a vacuum) 
V = V^o>0. 

This simplified model enables us to explain several phenomena 
occurring in metals. Some of its fundamental results, obtained for 
the case of free electrons, can be extended (with the help of quantum 
mechanics) to include the periodic field of a crystal (see below the 
simplest one-dimensional Kronig- Penney model, which correctly 
describes at least the qualitative aspects of many phenomena). 

The electron gas model of a metal was first considered in 
classical theory (the theories of Drude, Lor entz and others). In this 
version of the model, the classical Maxwell-Boltzmann statistics, 
which had successfully explained many phenomena in the kinetic 
theory of gases, was now applied to electrons. However, the elec- 
tron gas model encountered great difficulties in developing a 
theory of specific heat. In accordance with the theorem of equipar- 
tition of energy, well-known from classical statistical mechanics, 
each degree of freedom must have, on the average, an energy^ 

i ^av =-" 4 kT'. (6.30) 

where k is Boltzmann’s constant. From this, it is evident that the 
contribution of each tree electron to the total specific heat will be 
the same as that of a free atom: 


This contradicts the experimental facts which indicate that the 
specific heat of a monatomic metal is the same as that of the lattice 
atoms; that is, in the first approximation, free electrons make no 
contribution to the specific heat of a metal. 


4 

The specific heats of monatomic substances will be considered in greater detail in 
Chapter 12 [see formula (12.66)]. 
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This contradiction was resolved by Sommer feld, who showed 
that electrons in a metal do not obey the classical distribution 

_ ^ 

f---=:Ae 

Instead, the distribution is characterized by the Fermi-Dirac 
distribution function 


The Fermi-Dirac quantum statistics is based on the Pauli 
exclusion principle, according to which each energy level can be 
occupied by at most two electrons (two quantum states which differ 
only by the direction of spin). 

If we are given a three-dimensional potential well of a cubic 
shape, with side length equal to L, then, according to Eq. (4.57), 
the components of momentum p = fik will be related to the integers 
/q, rii and characterizing the energy level by the expressions 


Px^ 


'iTihth 
■ L " 


'InTirii 'Irtitu 

Py = —I—^ Pz= i 


We note that a unit interval of quantum numbers 


(A/Zj A/Z^ -- A/Z.( ~ 1 ) 

A/z, Vz 2 A/z., = ,^„^',y^. cPp . (6.30a) 

is associated with only one level, occupied by two electrons. There- 
fore, if there are Po electrons per unit volume, the maximum 
momentum of an electron at absolute zero(T = 0)can be determined 
from the relation 


Po= 



A/ZjA/z^A/z,, 


^max 

0 


pmax 


or 


Pmax — ft (3 tc^P«)^/® . 


The corresponding maximum kinetic energy is 

Inio IniQ ^ ‘ 


p.) pmax ti“ 


(6.31) 


(6.32) 


(6.33) 


This energy is called the Fermi energy. 
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As an example, let us compute the value of this energy for 
silver. The density of silver is 10.5, and its atomis weight is 107.9. 

Assuming that the number of free 
electrons is equal to the number of 
atoms per unit volume, we have 


Vacuum 


! Vacuum 


P„ = j^g.6.02.10- 


= 5.8 • 10^2 


where we have used the Avogadro 
number (the number of atoms in one 
gram-atom, equal to 6.02 • 10^*). 
Hence, Eq. (6.33) gives 


= • 10 erg=5,3 ev. 

Since the work function for silver 
is W — 3J ev, the depth of the 
potential well in silver is found to be equal to 9 ev. A schematic 
diagram of thefillingof energy levels in a metal is given in Fig. 6.4. 

The average energy of an electron in a metal is given by the 
equation 


Fig. 6 4. Model of a potential well for 
a metal. 

Kniax Upper limit of the filled 

levels at T — 0 (the Fermi energy) 


^max 

pit 2 ( pn 

Po 3 VP — 5 ^ max • 

0 


(6.33a) 


In agreement with experiment, it follows that at relatively low 
temperatures the electron gas makes no contribution to the specific 
heat, since 


c 


el 

V 


dEa y 

dT~~ 


0. 


If the temperature differs from zero, some of the electrons will 
jump into higher energy levels. The distribution of electrons in 
the higher levels will not be characterized by a Maxwellian dis- 
tribution, but by the Fermi distribution function 


f = 


I 


A 


1 


(6.34) 


At T — 0 (the case considered above) , this function equals unity if 
zero if /; > For T 0 the average energy can 

be obtained from the expression 

£av = |F5nax + iJ^ + 0(7'‘)- 

max 
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Hence, the contribution of each electron to the specific heat is 


tv 


f)E^ 

df 


2E\ 


” <h. 


(6.35) 


which vanishes as T-> 0. At high temperatures, when the quantity 
becomes much greater than unity, the Fermi-Dirac distri- 
bution function (6.34) approaches the classical Maxwell-Boltzmann 
distribution 


f = Ae-~E/kT^ (6.36) 

which, as we know, implies the following expression for the average 
energy of a free electron: 


£®i = |/er. (6.37) 

By comparing this expression with the average energy at low 
temperatures [see Eq. (6.33a)] and using the condition 

(6.38) 

it is possible to define the degeneracy temperature 

At temperatures higher than (T> we can use classical 
statistics to describe the behavior of electrons in the metal. If 
we substitute the value of Po, say, for silver, we obtain F^eg ^ - 10 — 20 
thousand degrees. Thus, at all temperatures at which a metal 
exists in the solid state, the electron gas has a certain degree of 
degeneracy. This means that in discussing the properties of elec- 
trons, we must use only Fermi-Dirac statistics, and, moreover, 
the principal term in the expression for the kinetic energy of free 
electrons is independent of temperature. 

This large value of the degeneracy temperature is associated 
with the small mass of the electron Ions and molecules have 
a mass thousands of times greater than the mass of an electron, 
and therefore classical statistics applies to them at ordinary 
temperatures. 


D. REMOVAL OF ELECTRONS FROM A METAL. 

COLD EMISSION 

From the potential-well model of electrons in a metal (see 
Fig. 6.4) we can see that to remove an electron from a metal 
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it is necessary to impart to it an amount of energy no smaller than 
the work function 

W = V,-E^^. (6.40) 

As we know, in the external photoelectric effect an electron 
receives an energy from the absorbed photon. Thus, an electron 
can leave the metal with a kinetic energy 

= (6.41) 

(Einstein’s equation). It follows that the work function represents 
the minimum amount of energy that must be added to the electron 
in order to make its energy greater than the height of the potential 
barrier. 



Fip; 6.5. Potential energy of an electron in a metal 
in the absence and in the presence of an external 
electric field. 

The dashed line shows the behavior of the potential 
energy curve when the image force is taken into 
account 


In a metal at 7^0 some of the electrons occupy energy 
levels lying above the Fermi level. If we increase the kinetic 
energy of the electron gas by heating the metal, a certain fraction 
of electrons may acquire an energy exceeding the height of the 
potential barrier, and thus a current will flow from the metal. 
This phenomenon, which is known as thermionic emission, is used 
to obtain an electron beam in electron tubes. Under the action of 
an external electrostatic field, this current may also arise even at 
lower temperatures. Let us consider the influence of an external 
electric field applied to the surface of a conductor in the negative 
X direction. The potential energy for this case is 

V(x) = V,-e,l.x, (6.42) 

where — is the electron charge and g is the electric field 
intensity (see Fig. 6.5). 
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In addition to the external electric field, the electron experiences 
an electric force called the image force. This force arises because 
an electron with charge induces an “image” charge eo at the 
surface of the metal (see Fig. 6.6). 

Metal 




Fig. 6 6. Image forces: an electron outside 
the metal experiences the attractive force 
of the induced charge. 

Thus the total force acting on the electron is 

P = (6.43) 

The effective potential energy, taking into account the image force, 
is of the form 

l^eff =1^0 — — (6.44) 
The quantity V eff has a maximum at the point x^, : 




aveff 

dx 


IiAjl 

y 


(6.45) 


The maximum value of Feff is less than Vq because 

^max ^0 — y (6.46) 

Thus, taking into account the electric image force shows that when 
an external field is applied, the work function decreases and be- 
comes equal to 

U7'=r-~y^. (6.47) 

The electric image forces, however, do not explain cold emission. 
In fact, an estimate of the maximum current (with W' = 0) for 
tungsten, for instance, gives the value 

g = ^ ^2 • 10* v/cm, (6.48) 


whereas experimentally rather strong current is obtained with a 
field as low as • 10® v/cm (Millikan). 
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Thus, within the framework of the classical theory it is 
impossible to explain quantitatively the cold emission of elec- 
trons. 

In the quantum theory of this phenomenon (essentially the trans- 
mission of electrons throughapotential barrier), we limit ourselves 
to £q. (6.42) for the potential energy and neglect the electric image 
force since it does not significantly affect the final result. It can be 
seen from the graph of potential energy (see Fig. 6.5) that the 
external electric field produces a potential barrier of finite width. 
Because of the tunnel effect an electron can penetrate this barrier, 
the transmission coefficient being given by 

Ai 

-^V2mA VV(x)-E ax 

D=e « (6.49) 


The integral in the exponent must be taken over the entire width of 
the barrier from jc = 0 to the point given by the condition 


Va ” == ths^t is , 



(6.50) 


Then 


jfi 

\ YV{x) — Edx=y^ YV^ — eS^ — Edx=^ 
0 0 

xx 


(6.51) 


Finally, we obtain the following expression for the transmission 
coefficient D: 


U—e 


_8o 

€otl^ — g S ^ 


(6.52) 


where the quantity 8o depends on the nature of the metal and the 
energy of the free electrons inside the metal. The cold emission 
current is proportional to the transmission coefficient 

So 

y==/,D=yoe (6.53) 

It follows from the last equation that cold emission should be 
observed for an electron field of gr>^10® v/cm. This result is in 
good agreement with experimental data. 
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E. CONTACT POTENTIALS 

Contact potentials, which were discovered by Volta, can also be 
explained on the basis of the tunnel effect. Let us consider two 
different metals with different work functions and different Fermi 
energies (see Fig. 6.7). If these two metals are brought into contact, 
they will still be separated by a potential barrier of finite width. 

Vacuum Metal I Vacuum Metal II Vacuum 



Fig. 6.7. Two metals before they are placed in contact with each 

other. 

Wj and M^2 work functions, and Eq 2 are the upper limits 

of the filled levels (the Fermi energies) 


Since a certain number of filled energy levels in metal I lie above 
the highest filled level of metal II, electrons can move from metal I 
into the empty levels of metal II by the tunnel effect. From Fig. 6.7, 
it is seen that no flow in the opposite direction is possible, since 
electrons of metal II would then have energies corresponding to 
filled levels of metal L It is obvious that the electric current from 
metal I to n ceases only when the uppermost filled levels of both 
metals are of the same energy. 



Fig. 6.8. Two metals after they have been brought into 
contact. Formation of the contact potential. 


As a result of the tunnel effect, metal II acquires an excess of 
electrons and is charged negatively, whereas metal I is charged 
positively. Thus, the energy levels of metal II are shifted upwards 
relative to those of metal I (see Fig. 6.8). After the Fermi levels 
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in both metals are equalized, the electric current ceases, but then 
there arises a potential difference proportional to the difference 
between the work functions of the metals: 


= — (6,54) 

eo 

This quantity is called the contact potential. 


F. THE MOTION OF ELECTRONS IN A 
PERIODIC ELECTRIC FIELD (THE ONE-DIMENSIONAL 
KRONIG-PENNEY MODEL) 

As has been previously mentioned, the representation of the 
motion of an electron in a metal in terms of the potential well 
model is an approximation in which we average out the periodic 
potential of the lattice. A number of characteristic features of the 
motion of electrons in a crystal appear only when the periodic 
variation of the potential is taken into account. In the general case, 
the solution of the problem is very complicated. In order to deter- 
mine some of the qualitative features of this motion, we may con- 
sider, however, a simplified model of a crystal. 

In the one-dimensional Kronig-Penney model, the periodic 
electric field produced by the positive ions of the crystal is approx- 
imated by a periodic square-well potential of the form shown in 
Fig. 6.9. The width of each well is denoted by a, and the width of 
the barrier between two successive wells by b. Thus, the period of 
the potential (the equivalent of the lattice constant) is c = a-\-b. 
The barrier height is set equal to Kq. 



P'lg 6.9. One- dimensional Kronig-Penney model of a 
crystal. 

The solution of Schrodinger^s equation for the nth section of 
the periodic potential has the following form: 
for the potential well; 

= An sin kXn + Bn COS kXn, (6.55) 

for the potential barrier: 

^k = An sinh — c) + Bi cosh 


(6.56) 
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Here^=|/^^^, Xn = x — cn; the coordinate 

is measured from the origin ofthe nth section (that is, the nth well). 
Similarly, for the (n+ 1)®^ section, we can write 

= Sin kXn^i~\~Bn+i cos kXn,i. - (6.57) 

We first join solutions (6.55) and (6.56) at the point = rn + a (that 
is, atjc^ = a), obtaining 


An^inka-]- B^coska = — yl^sinh -|- ^^cosh x^, (6.57a) 

An^oska — BnSiT]ka = ^ (AnCosh y.b — B,isinh x6). 

Next, joining the solutions (6.56) and (6.57) at the pointA:==c(n + 1) 
and noting that x„ = c and we obtain 


Bh = 4- 1 , 

A'n = -^An^l. 


(6.58) 


Let us substitute (6.58) into (6.57a) and simplify the problems by 
considering the limiting Kronig- Penney case in which the width of 
the barrier between two wells tends to zero (b->0), the height Va 
tends to infinity, and the width of the well remains constant; 


"m^ba 

“2 


= p== const. 


Then, since cosh xft-> 1 and sinh x^->x^, we have 

sill to -f cos ka^B„,i, 

A„ cos to — B„ sin to = /1„4, — 

Equations (6.59) are linear difference equations; their solutions 
should be sought in the form 

A, = C,r^ Bn = C/\ (6.60) 

Substituting (6.60) into (6.59) and dividing both equations by we 
obtain an equation from which we can determine the quantity r and 
the relationship between the coefficients C, and C>; 


Cl sin /v’a = C 2 {r — cos ka)y 
C .2 ^ r — sin ka] = Ci (r — cos ka). 


(6.61) 
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Multiplying these equations together and dividing both sides of the 
resulting equation by CiQ, we obtain an equation for r: 

— 2r cos /j'a + 1 = 0, (6. 62) 

where cos ka is given by the equation 

* p 

cos k'a = sin ka -f- cos ka. (6. 63) 

As we shall see below, (6.63) is the fundamental equation for the 
energy levels in the periodic field of a crystal. 

The solution of (6.62) has the form 

r = cos ka zb i sin ka = e~ . (6. 64) 

We note that if the right-hand side of (6.63) is greater than unity, 
/e'will be imaginary, and in this case we get an exponential solution 

r = (6.65) 

Let us examine in greater detail the solution for the case of real 
values of k 


r = 


From Eqs. (6.61) and (6.60), we have 


Cj — Cq 


- cos ka 


Jk'a _ 


sin ka 


cos ka) 


sin ka 


C, = C„ = 


Substituting the values of y4„and B„into (6.65) and using the fact that 
x„=x — an when b-*-0, we find an expression for the wave function 
in the crystal: 


— ( 6 . 66 ) 

where U„ is a function with the same periodicity as the crystal 
lattice 

si" HXn-a)l (6.67) 

In particular, if P = 0 (that is, there is no barrier), we find from 
(6.63) that/j' = /e; in this case the function becomes unity. 

It follows from (6.66) that an electron can move freely in the 
crystal if k is a real quantity, that is, if the right-hand side of 
(6.63) is less than unity. 
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G. BASIC PRINCIPLES OF THE ELECTRON THEORY 
OF CONDUCTIVITY OF CRYSTALS 


The quantum theory of electron motion in a crystal lattice provides a key to distin- 
guishing between conductors, dielectrics or insulators, and semiconductors (which in a 
sense form an intermediate class of solids). We do not intend to treat this subject 
quantitatively and shall confine ourselves to a few qualitative remarks based on the one- 
dimensional Kronig-Penney model. 

As the starting point of our analysis, we shall use Eq, (6,63) to determine the possible 
values of the electron energy In a crystal lattice. This equation is 

p 

cos k'a = sin ka + cos ka, (6,63) 


where a is the lattice constant, and the quantity k = gives the electron energy. 

The eigenvalues of the electron energy can be found from the condition that k' must be 
real. This condition means that the right-hand side of Eq. (6,63) must be less than unity, 
^ttlngP = oo, we obtain the energy spectrum of isolated atoms (in this case the atoms 
are separated from one another by an impenetrable barrier). The energy spectrum will 
then consist of a separate set of levels for each well: 

P _ ( 6 . 68 ) 


where n= 1, 2, 3 .... We shall not consider the negative values of since they give 
exactly the same values of energy, but correspond to the motion of electrons in the 
negative A" direction. The first two levels (n = I, 2) 


^1 = 


2moa- * 


(6,69) 


of isolated atoms are shown in Fig. 6,10. 


-£2 


£2 


Energy levels of 

isolated atoms . . . ^ 

Energy levels of the ^ 
crystal lattice 


Fig. 6. 10. The formation of energy bands in a crystal lattice. 

If P is finite [see Eq. (6.63)], it is easiest to determine the energy levels graphically 
(see Fig, 6,11), The allowed values of k (and therefore of the energy) correspond to the 
values of the right-hand side of Eq. (6,63) lying between — 1 and 1, In Fig, 6,11, these 
allowed values are denoted by a heavy line. Thus, in a crystal lattice containing N atoms 
each energy level of an isolated atom is split into N levels. Each of these groups of 
levels is called a band (see Fig, 6, 10), 
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The electrons in the crystal tend to occupy the lowest energy levels and^ according to 
the Pauli exclusion principle, each level is occupied by at most two electrons with 
opposite spins. 

For example, the crystal lattice of an alkali metal contains N valence electrons if 
there are atoms. The electrons occupy only half of the lower band (since there can be at 

most two electrons in each level). In 
the ground state, half of the electrons 
move in one direction, and the other 
half in the opposite direction; con- 
sequently, the average current is equal 
to zero. When an electric field is 
applied, more than half of the electrons 
move in some preferred direction, 
thus producing an electric current and 
moving up Into higher energy levels. 
Therefore, in a metal, either the first 
band must contain a sufficient number 
of unfilled levels, or it must come into 
contact with a second empty band, called 
conduction htmd^ 

On the contrary, in a perfect insula- 
tor all levels in the first or valence 
band are occupied and the levels of the second (conduction) band are all empty. The energy 
spacing between these bands is usually several electron volts. It is easy to show that, 
when a field is applied, the electrons cannot acquire a preferred direction of motion. The 
direction of motion of an electron can be reversed only if the electron goes into a different 
energy state. But since all the energy levels are occupied, this can only happen if the 
electron occupying that other energy state makes the opposite transition to the state 
originally occupied by the first electron. Therefore, on the average, there can be no 
preferred direction of motion of the electrons even when a field is applied. A diagram 
of the energy levels of a conductor and a dielectric is given in Fig. 6. 12, 



Fig. 6.11. Determination of possible energy 
levels in a crystal lattice. 

The heavy lines on the abscissa axis indi- 
cate the allowed values of ka (allowed 
bands). 



"O- 


Metal Perfect dielectric 

Fig. 6.12. Energy level diagram of a metal and a dielectric 


Nevertheless, every dielectric possesses some (very small) conductivity. The con- 
ductivity is much greater in semiconductors, where the forbidden band is considerably 
smaller than in dielectrics; it is of the order of 1 ev, and sometimes even less (for 
example, in germanium, the width of the allowed band is 0.66 ev at 7 ==300 ‘K)l At absolute 
zero, semiconductors behave like dielectrics; however, their conductivity increases with 
rising temperature, particularly in the presence of impurities. At room temperature, 
the resistivity of semiconductors is found to be of the order of 10'^ — 10^ ohm.cm. At the 
same time, the resistivity of dielectrics lies within the range of 10^— I0**ohm»cm, and 
that of metals in the range of 10'®— lO'* ohm*cm. 

The conductivity of dielectrics and semiconductors can be of two types. Let us assume 
that under the influence of thermal excitation or the internal photoelectric effect (absorb- 
tion of light by electrons), some of the electrons Jump from the valence into the conduction 
band, leaving vacant states ('^holes'') in the valence band. The electrons that have jumped 
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Into the conduction band become carriers of electric charge, thus producing a current. 
On the other hand, as soon as several states become empty in the valence band, an elec- 
tric current can also be produced due to the motion of electrons in the valence band 
itself. It can be shown that the motion of a system of electrons in an almost completely 
filled valence band can be treated in terms of the motion of a set of vacant states or 
'Tioles.^* Obviously, the holes move in a direction opposite to the direction of electron 
motion, so that they behave like positively charged particles. Thus, there are two ways 
in which charge can be transported and, therefore, two types of electric current in solids: 
n type ( due to the motion of electrons) and p type (due to the motion of holes). 

With the help of the band theory, we can easily give a qualitative explanauon of a 
number of interesting phenomena. 

For example, the conductivity of a metal increases with decreasing temperature 
because its resistance is due to the interaction between free electrons and the vibra- 
tions of the lattice. Since the vibrauons of the lattice diminish as the temperature 
decreases, the resistance will also decrease. On the contrary, the conductivity of semi- 
conductors decreases as the temperature is lowered, because the number of electrons 
in the conduction band becomes increasingly smaller. The existence of p type conductivity 
in semiconductors has been demonstrated in investigations of the Hall effect and thermo- 
electric emf. The sign of the potentials appearing in these phenomena is determined by 
the sign of the current carriers. Experiment shows that in some semiconductors the 
sign of the potentials corresponds to electron carriers, whereas in other semiconductors 
the sign of the potentials is reversed; that is, the current carriers act as if they were 
positively charged particles, A natural explanation for this is provided by the concept 
of "holes," 


The above conclusions in regard to the conductivity due to transitions of electrons 
from the valence band into the conduction band (the long arrow on the left-hand side of 
Fig. 6.13) and due to the motion of holes in the valence band are based on the assumption 
of a perfect crystal. This type of conductivity is known as intrinsic conductivity. 

Mother type of conductivity, known as impurity conductivity, also plays a significant 
role in semiconductors. It is caused by the presence of foreign impurities or other 
structural defects in the lattice. Such disruptions of the perfect periodicity of the lattice 
lead to local deviations of the field from a perfectly 


periodic one. As a result, discrete levels may 
appear in the forbidden band of the energy spec- 
trum of the electrons (see Fig, 6. 13). In practice, 
the wave functions associated with these levels 
differ from zero only in a certain region near the 
given defect. Thus, discrete levels are sometimes 
called impurity levels. They do not themselves 
contribute to the current (the electrons occupying 
them are not free). However, they may affect the 
number of electrons contained in bodi the con- 
duction and the valence bands. 

Impurity levels (denotedby short arrows on the 
right-hand side of Fig, 6. 13) can be divided into two 



Conduction 

band 

Donor level 


Accepter 

level 

Valence 

band 


types: donor levels and acceptor levels. A donor 

is capable of supplying electrons to the conduction Kig. 6.13. Diagram of energy bands 


band, so that free electrons will appear in the m a semiconductor. 


conduction band. In contrast to a donor, an 


acceptor absorbs electrons; as a result a "hole" is formed in the filled band so that 
electrons in this band are able to Jump into higher energy states (ptype conductivity). 

If the impurity levels are located sufficiently close to the edge of the corresponding 
bands, ionization (or other effects) can arise fairly easily due to the energy of thermal 
motion of the lattice. Therefore, an appreciable number of free electrons (or holes) can 
exist in bands even under conditions when direct excitation of electrons from the valence 


band into the conduction band is highly Improbable. 

In the case of impurity conduction, the magnitude and type of the conductivity are 
determined mainly by the nature and concentration of the impurities. By varying the 
impurities, it is possible to control within a wide range the magnitude of the conductivity 
as well as its type ( n type or p type),^ This fact is widely utilized in semiconductor 


Pure germanium has a high resistance but the addition of a small amount of impurities 
may give it either n or p type conductivity 
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electronics. It constitutes the basis of operation of modern high-quality crystal diodes 
and transistor devices which are capable of rectifying AC currents, as well as of ampli- 
fying and generating electrical oscillations. 

In conclusion, we would like to draw attention to two important approximations which 
we have implicitly assumed in the electron theory of solids. First, in developing the 
theory of metals and semiconductors, we were actually concerned only with the problem 
of single electrons. Accordingly, this theory is called the single-electron theory. The 
collective properties of electrons in solids were taken into account only in connection with 
the filling of energy levels by free electrons, when we calculated certain statistical 
quantities (the Fermi energy, specific heat, etc,). Even in those cases we completely 
neglected interactions between electrons, although they do in fact experience mutual 
Coulomb repulsions. It is easy to show that, for the electron densities which we observed 
in metals, the average energy of this repulsion Is not necessarily small in comparison 
with the Fermi energy. There naturally arises the question of the extent to which this 
crude single-electron model represents the properties of a real metal. The answer to 
this question is given in the many-electron theory of solids, wlilch we cannot consider 
here since this recently developed theory is a highly specialized subject. We may note 
that the fundamental notion of the electron gas as a free carrier obeying Fermi-Dlrac 
statistics IS retained in the multielectron theory. Accordingly, the laws of purely 
statistical character obtained in the single-electron theory still hold true, such as, for 
example, the linear dependence of the electronic specific heat on temperature. At most, 
certain numerical coefficients will have to be slightly changed because of the specific 
form of the wave functions for a system of electrons. These refinements, however. 
Involve exceptionally large computational difficulties. 

On the other hand, in dynamical problems, where it is essential to take into account 
the interactions between electrons, the simplied approach used above (the single-electron 
theory) is Inadequate. One case of this type is the problem of the strength of a metal, 

7he second approximauon which is implicit in the single-electron theory concerns the 
crystal lattice, which is regarded simply as the source of a certain statistical field. The 
situation is actually more complicated because the lattice ions execute a vibrational 
motion, which, as we know, persists even at absolute zero (zero-point vibrations). In 
order to take this fact into account, it is necessary to consider the crystal lattice as a 
quantum-mechanical system rather than simply as the source of a field (see Chapter 12, D), 
The essential point in this connection is that the lattice is not Isolated from the system 
of electrons, but coupled to it since the electrons interact with the lattice ions. The energy 
of this interaction can be uniquely represented by the sum of two terms. One of them 
represents the potential energy of Interaction between the electrons and the stationary 
lattice. The motion of electrons in a field of this type was considered above in the simple 
case of the Kronig- Penney model. The second term is connected with the deviations of 
the ions from their equilibrium positions and represents the interaction energy between 
the electrons and the lattice vibrations. The average value of this interaction energy is 
extremely small in comparison with the energy of the electrons at the Fermi level, 
because the amplitudes of the vibrations of the lattice ions are small (except at tempera- 
tures near the melting point). 

The vibrations of the lattice, however, may play an important role at low temperatures, 
when the Interaction of each electron with the vibrations of the lattice gives rise to an 
additional interaction in a pair of electrons. If these two electrons have opposite spins, 
the Interaction results in an attraction (in contrast to electrostatic repulsion). As a 
result, the pair of electrons begins to move with a certain degree of correlation. Such a 
system is found to have the property of superconductivity, ^ The smallness of the inter- 
action which causes the correlation explains why superconductivity is observed only at 
extremely low temperatures, 7 


^A theory of superconductivity based on these concepts was developed independently 
by the American physicists Bardeen, Cooper, and Schrieffer, and by the Russian physicist 
Bogolyubov [See N Bogolyubov, V Tolmachev, D. Shirkov, A Neiv Method in the Theory 
of Superconductivity (trans ), New York: Consultants Bureau, 1959 1 

^See R Peierls, Quantum Theory of Solids, Oxford: Clarendon Press, 1955; W Shock- 
ley, Electrons and Holes in Semiconductors. New York: Van Nostrand, 1950 



Chapter 7 


Statistical interpretation of Quantum 
Mechanics 


a/ elements of the theory of linear 

OPERATORS 


In our general investigation of the Schrodinger wave equation 
(Chapter 5), we saw that in quantum mechanics the momentum 
operator p = — ih^ is associated with the classical momentum p of 

the particle, the energy operator E = /ft with the energy the 

Hamiltonian operator or simply Hamiltonian H with Hamilton’s 
function //, etc. Before analyzing the physical meaning of the 
quantities represented by operators in quantum theory, let us 
consider certain general aspects of the theory of operators. 

In the same way as a function relates a number a: with another 
number y = f(x), an operator M associates one function / (a:) with 
another function 


^(x)==fAf(x), (7.1) 

according to some given rule. 

In order to satisfy the principle of superposition, only linear 
operators with the two following fundamental properties are used in 
quantum mechanics: 




(7.2) 


where C is an arbitrary constant. 

The linear operators most commonly encountered are the differ- 
entiation sign (for example, the momentum operator M = ^ and 

the LaplacianM= v’) and the integration sign. 

In Poisson's equation 


Vf{r)=p{r) 


(7.3) 
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the operator is the Laplacian which converts the function / (r) into 
another function p(/-). Conversely, we can solve Poisson’s equation 
and find 


f(r) = v-’p (r) = - J p (O Kir, r') d^x’. (7.3a) 


Here the operator the form of a definite integral, the kernel 

of the operator being 




1 ___ 

47i|r-r‘r‘“ 


I 

+ (y -yr + ’ 


The operators and which have the same effect as multiplica- 
tion by unity when they are applied in succession, are called inverse 
operators. 

The Hamiltonian operator 


H = T + 1/, 


consists of the sum of the kinetic energy operator T = — 

which is directly porportional to the Laplacian, and the potential 
energy V, which is simply a function of the coordinates. 

We can regard the action of the potential energy V (.y j on the 
w ave fun ction as the actiotTot’A llliy arUUferalor . bec ause conditions 
are satisfied THerelore, besides differential and integral 
operators, the linear operators that may be used in quantum me- 
chanics include any function of coordinates whose action on the 
wave function is simply to multiply it. For example, the coordinate 
r is just as entitled to be considered an operator (the position 
operator)^ as the momentum operator p = — /7/v» which is a differ- 
ential operator. It is worth noting in this connection that in quantum 
1 mechanics r does not represent the position of a particle, but is an 
I argument of the wave function and determines its value in the 
coordinate space. The quantity which is equivalent to the position 
can be found from the operator r and the function ^(r) in the same 
way as the momentum of a particle is found from the momentum 
operator, that is, by averaging (see the following). 


^This will become particularly clear when we write the Schrbdinger equation in momen- 
tum space (see below), where the wave function depends on p In this case, the momentum 
operator p will correspond to multiplication by an ordinary function, and the position oper- 
ator X to differentiation with respect to p. 
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B. ELEMENTS OF REPRESENTATION THEORY 

The position operators x, y and z obviously commute with each 
other, since operating with them is equivalent to multiplying by 
ordinary numbers. Accordingly, xy = yx, xz = zx , and soon. The 

operators P v — — . Pv == — ^ and p^. ~ — t/i ^ also commute 

with each other, since the result of differentiation is independent 
of the order in which it is performed 


___ ^^2 
dydx * 


dydz Ozdy * 


and so on. 


Similarly, it can readily be seen that the pairs of operators p^ and 
y, p^ and X, and so on, also commute with each other: 


Pa'/'}' = - 'A ^ </'? = — Hiy S- = y?A , 


that is. 


pA-i/==^yp... 


An example of a pair of noncommuting operators is provided 


by X and p^. Indeed, 



(7.4) 

whereas 


= - ih = -ili[\+ x^) 

(7.5) 

Consequently, 


— A-pJt = — 

(7.6) 

that is. 



(7.7) 

In a similar fashion, it is readily shown that 


y^y — ypy = p^z — zp^ = - ifi- 

(7.8) 


The noncommutativity of these operators, as expressed by Eqs. 
(7.7) and (7.8), is a significant feature of quantum mechanics. 

The specific c hoice of the form of position and momentum 

operators x and(p7=^/^^^ satisfying Eqs. (7.7) and (7.8) corre- 


sponds to the sa-uaHUU Coordinate representation, i n which the 
wave function deoends onlTTJITTBBTSTHrH!^^ r I o =V(r) L 
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The Schrodinger equation can also be written in the momentum 
representation. For the sake of simplicity, let us take the one- 
dimensional case. Using the S functions [see (4.70)], we expand 
^ (a) into a Fourier integral 

(ji {x) = ji* (x') 5 (a — a') dx' = ^ dkdx'^ (a') 

Since ^ = where p = we can rewrite this equation in the form 



(7.9) 


(7.10) 


The Fourier transform of the function (a) , namely, the function 
cp(/;) which depends on momentum, is called the wave function in the 
momentum representation. Equations (7.9) and (7.10) relate the 
wave functions in the coordinate and the momentum representa- 
tions. 

Let us find what operator in the momentum representation 
corresponds to the position coordinate x. We note that in this 
representation we need not write p as an operator (that is, in 
roman type). If we substitute 'j)(A) for Atj>(A) in Eq. (7.10), then 

X'f (p) = ^ X '!/ (a;) e dA: = M , (7. 11) 

and thus in the momentum representation the operator x has the 
form 



It is easily verified that Eq. (7.7) continues to hold true in the 
momentum representation, since 

(xp — px) 9 (p) = in [ — p = ih'f (p). 

In exactly the same way, the fundamental relation (7.7) will be 
satisfied if x and p are replaced by certain appropriate matrices. 
This form of representation is called matrix representation; it 
was introduced by Heisenberg somewhat before the discovery of 
Schrodinger’s equation. 

The remainder of our discussion of operators will be conducted 
in the coordinate representation. 
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C. AVERAGE VALUES OF OPERATORS 

In classical mechanics the motion of an individual point is 
exactly specified by a function relating its position to time. This 
dependence can be uniquely determined from the fundamental 
differential equation of motion 


mor=— V V (r). 


Once we have determined r as a function of time, we can also deter- 
mine the momentum and energy of the particle. 

The situation is somewhat different if there are many particles 
involved, as, for example, in the kinetic theory of gases. In this 
case statistical laws characteristic of a large collection of parti- 
cles must be used. It turns out that the particles of such a collection 
obey certain distribution laws, which, generally speaking, apply to 
both coordinate space and momentum space (that is, a distribution 
describing both the velocities and the energies). The function f that 
characterizes this distribution is called the distribution function. 
Thus, when we deal with a large collection of particles, we can 
only consider a probability that a particle possesses particular 
coordinate and momentum values. From the distribution func- 
tion we can fin4 that average values of the position and momen- 
tum. 


x = ^ xfd^xd^p, ^ pjd'^xd% 

and the mean square of these quantities 

ri = J x'^fd^xd^p , and so on, 

which in accordance with the law of large numbers should agree 
with the corresponding experimental values. 

We must mention one characteristic feature of these statistical 
laws. In classical physics they are a result of averaging over the 
so-called ‘‘hidden^’ parameters, which determine the motion of each 
particle in accordance with Newton’s equations. These hidden 
parameters do not appear in the final results. In principle, however, 
classical theory enables us to explain why, at any instant of time, 
the coordinates and momenta of individual particles differ from the 
average values, even though this explanation may be very com- 
plicated mathematically. 

In quantum mechanics the behavior of particles is described by 
the wave function <j>(r, /), which is a probability function even when 
the system it describes consists only of a single particle. Thus, 
quantum mechanics allows us to determine only the average values 
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of dynamic variables regardless of whether there is a large 
number of microparticles or only one. It must be emphasized that 
in quantum theory it is in principle impossible to explain the 
deviations of observed variables from the average values. ^ The 
method of calculating averages in quantum mechanics is similar 
to that used in statistical mechanics. The basic formula used for 
this purpose is 


M= J (7.12) 

where M is an arbitrary operator (as a special case, it may be a 
number), and the quantity (O'MO plays the role of the distribution 
function f provided that the wave functions ([>(/) are normalized: 

(O'HO d^x=\. 

The average values of the position and momentum, as has already 
been mentioned, are in fact computed in basically the same way: 

a: = j ([)* (t)x>h {t)d'x, 

( 7 . 13 ) 

Here x is the coordinate of the center of mass of the wave packet 
associated with the function ^ (/), and p^is the momentum of this 
center of mass. 

Since the outcome of a physical measurement is a real quantity, 
the average values must be represented by real numbers. There- 
fore, the following equation must hold: 

(AI)* = A1. (7.14) 

When this requirement is satisfied, the corresponding operators 
are said to be self-conjugate (or Hermitian). 

In particular, we shall show that the operator p, satisfies the 
condition (7.14), even though it appears to be purely imaginary. As 
a preliminary, we must first prove an important theorem for 
“transferring’’ a derivative. This theorem is as follows. Suppose 
we have an integral 


oo 

( 7 = ^ uv^'^UiXj 

— cX) 


(7.15) 


2 

Von Neumann has demonstrated that hidden parameters cannot be the basis for the 
statistical laws of quantum mechanics Von Neumann's proof, howeVer, is valid only with- 
in the limits of the actual framework of quantum mechanics itself, and if quantum me- 
chanics IS not taken as an ultimate theory, Von Neumann's theorem cannot be regarded as 
generally valid. 



STATISTICAL INTERPRETATION 


25 


where . Then, if all terms of the type 


uv 




oo 

(n-'2) I 

—CO 


(7.16) 


vanish, the result of integration of G is not altered if we transfer 
the / 2 th derivative of the function v to the function a in (7.15) and 
place the factor (—1)'' in front of the integral: 


s 




:(-lr 




(7.17) 


Indeed, if we carry out an //-fold integration by parts in (7.15) and 
assume that all terms in (7.16). vanish, we obtain the relationship 
(7.17). In the case of a discrete spectrum, the conditions (7.16) are 
always satisfied because the wave function decreases exponentially 
at infinity. In the case of free motion (that is, a continuous spec- 
trum), these expressions vanish as a consequence of the periodicity 
condition. Physically, the fulfillment of condition (7.16) means that 
no particles or currents exist at infinity. 

Returning to the proof of the self-conjugateness of the operator 
p^, we substitute 


u = (/), v = — in^]) (t) and n == 1 

into Eq. (7.17). From this it immediately follows that 

and thus the self-conjugateness condition (7.14) is satisfied for 

We note that unlike the operator pj, = — ih ^^^-,the real operator 

is not self-conjugate and its average value has no physical meaning. 

If an operator M has only one eigenvalue >. (and one eigenfunc- 
tion '^), it is readily seen that this eigenvalue is identical with the 
average value of the operator. Indeed, using the general rule (7.12) 
for determining the average value of an operator and substituting 
the equation 




(7.18) 


we obtain for M 


M 




9-(0 




(7.19) 
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On the other hand, suppose that the operator M [Eq. (7.18)] has 
several eigenvalues Xi, X.,, . , . , X„, . . . , corresponding to the functions 
4^1 (0. example, this may be the case for 

the energy operator E — for which X^ = £„). Then since the 

general solution tl>(/)can be written in the form 

n 

we find that the average value of the energy operator E = ih ^ is 

E = 2^IC„rE„. (7.21) 

n 

Here each |C„|® is the probability that the particle is in the corre- 
sponding quantum state. If all except one,C„„, are equal to zero, 
then B=^En„f and thus the average energy corresponds to the eigen- 
value Consequently, the eigenvalue En^ corresponds to the 

experimentally observed energy. In the case where several co- 
efficients Cn differ from zero: C^^., . . . , , we can obtain 

any of these values of energy in experimental measurements. If 
the experiment is repeated many times, the number of measure- 
ments which yield an energy En^ should be proportional to the 
corresponding theoretical probability | 



Chapter 8 


Average Values of Operators. Change of 
Dynamic Variables with Time 

A, DERIVATION OF THE UNCERTAINTY PRINCIPLE 

As indicated in the preceding chapter, the observable dynamic 
quantities associated with the operators must be regarded as 
average values, given by Eq* (7.12). This is true regardless of 
whether these operators commute with tfie Hamiltonian, that is, 
regardless of whether the corresponding physical quantities are 
constants of the motion. 

We shall now show that if t 3 a^ 3 L_dy. nani ic quantities correspond 
to noncommuting operators, they do not have simultaneous definite 
values in quantum mechanics. Of greatest importance in this 
respect is the calculation of the deviations from the average values 
of two canonically conjugate quantities— the position x and momentum 
Px. Our discussion will be carried out in the coordinate represen- 
tation and we shall restrict ourselves to the case in which the wave 
function is independent of time (the stationary case). Then the 
average values of the position and momentum can be found from 
the relations 


( 8 . 1 ) 

( 8 . 2 ) 

First of all, we note that even though the average error or 
average deviation from the mean, which is given by 

{^x)= J (x — x) ^d^^x = X — X ==0, (8.3) 

is equal to zero, it in no way follows that the particle cannot occupy 
positions other than Jc. The reason for this is that the deviations 
have different signs relative to the mean x, and consequently they 
cancel out on the average. Accordingly, the deviation from the 
average value of the operator should be characterized by the 
variance (mean-square deviation), which is positive for all deviations 
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from X. The variance of the position can be calculated from the 
formula 


(Ax)^ = ^ (jc — xY ^ (Px = x^ ~ 2 (x)‘^ + {xf = x^ — (a:)^ (8.4) 

We note, incidentally, that if the variance is (Ax)‘^==0, it follows 
that the probability of the electron occupying a position in space 
differs from zero only at x^x. In this case the average value of 
the position is equal to the exact value; that is, the corresponding 
probability of the particle’s position can be described by a function 
similar to the I function. 

Similarly, the variance of the momentum is given by 

^ '1'* — = (8.5) 

hi order to establish the relationship between and 
let us take a coordinate system whose origin lies at the center of 
mass of the wave packet (x = 0), and which moves with the same 
velocity as the center of mass (Pjt==0). (The use of this coordinate 
system does not involve any restriction on the generality of the 
discussion.) 

In this case 


{^xY = ^ ^*x^ d^x, ( 8 . 6 ) 

Let us consider the integral 

/ (a) = J (a.X'i* + cPx. (8.7) 

where a is some arbitrary real quantity independent of x. Equation 
(8.7) can be written in the form 


/(a) = /la*--Ba + C, 


(8.7a) 


where 


/I = J 

= — I X ^ 1, 

C = J f S * = -Sr ) ♦• (- .0 jI)’ (.A = 4 > 0. 


( 8 . 8 ) 
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Since the integrand in (8.7) is essentially a positive quantity 

/(a)^0. (8.9) 

condition (8.9) imposes a certain definite restriction on the co- 
efficients A, B and C. Indeed, if this relation is satisfied for a = ao 
corresponding to the minimum of the function I (a), it will also be 
satisfied for any arbitrary a. The value of ao can be found from the 
condition 


/'(M==2/lao-~B = 0, that is, ^ 


and 


/"(a) = 24>0. 

Hence the minimum value of I (a) will be equal to 

(®o)== — ^4 

From this it follows that inequality (8.9) will hold for all real values 
of a, provided the following condition is satisfied: 

B^^4Aa ( 8 . 10 ) 

Substituting the values for A, B and C from (8.8) and using (8.6), 
we obtain the relationship between (Ap^y and (AxY 

( 8 . 11 ) 

This inequality represents a rigorous formulation of the uncertainty 
principle. 

By using the relation -—Arp = — [see Eq. (7.7)], we can 
rewrite (8.11) in the form 

( 8 . 12 ) 

Generalizing (8.12), we can say that whenever two operators and 
Mi do not commute with each other, they satisfy the uncertainty 
relation 


(AM,)» . (AM,r ^ j I M,M, - M,M, r. (8.13) 

where 

(AMJ’ == J <1-* (M, — AT,)’ -I- dJ'x, {i =1,2). (8.14) 
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As we have already mentioned, the uncertainty principle is a 
consequence of the wave-particle duality that underlies quantum 
mechanics and is in no way connected with the e3q}erimental limita- 
tions, Experiments may only prove the results which follow 
from the uncertainty principle. The basic meaning of the uncertainty 
principle consists in the following fact: the probability distributions 
of variables whose operators do not commute cannot simultaneously 
take the form of a S function (see Fig. 8.1). Moreover, if the prob- 
ability distribution of one variable approaches a S function, the 
probability distribution of the other variable will spread out. In 
the limit, when, for instance, the proba bility function for x (that 
is, takes the form of a B function [(Ajt:)‘^ = 0], the probability 

function for the momentum (that is^ l?(Pjc)l^) becomes such 

that it is constant for all values of (Apj^)* = oo. 



Fig. 8.1. The probability distribution function in (a) coordinate 

space and (b) momentum space: [(A.x)^ - h/2. 

If the distribution in coordinate space (a) contracts, the distribu- 
tion in momentum space (b) spreads. 


The necessary condition for simultaneous measurement of two 
dynamic quantities is the condition of commutativity of their 
corresponding operators. 


B, POISSON BRACKETS IN CLASSICAL AND QUANTUM 

THEORY 

The state of a system in classical mechanics is defined by its 
dynamic variables. The quantities appearing in the canonical, or 
Hamilton’s, equations of motion in classical mechanics depend on 
the coordinates x,, momenta p, and time ^ that is. 


f=f(p,, X,.. t). 


(8.15) 
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For example, in a one-dimensional time-independent problem, 
the Hamiltonian depends only on x and p^: 


With the help of the canonical (Hamilton’s) equations of motion 
we obtain 


i = = ^ and 

OPx 


dx dx * 


(8-17) 


or 


tllQX - 


dV(x) 


dx 


If there are n degrees of freedom (/=1, 2 ,..., n), Eqs. (8.17) 
take the form 


dH 


dH 


dpi' dXi' 


(8.17a) 


Hence the time rate of change of the quantity f [see (8.15)] is given 
by the equation 


t 

Using the canonical equations (8.17a), we obtain 

— — -^-1 \H h 

where the expression 


[ti. 



dji d/ 

^p^ dXi 


oH df\ 

dyidpi) 


(8.18) 


(8.19) 


is called the classical Poisson bracket. 

If f does not depend explicitly on/ , then , and consequently 

the variation of / is completely determined by the Poisson bracket: 

g==[//. /!. (8.20) 


If the Poisson bracket vanishes ([//, /l = 0), the quantity f does not 
depend on time, or It is conserved. 

/== const. 
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For example, if the energy does not depend e^^licitly on time^ then 
d///d/ = 0. Since obviously [//, //]“0, it follows that Hamilton’s 
function (the energy in this case) is a constant (// = const). Further- 
more, substituting the coordinate Xi, and then the momentum for 
f into (8.20), we obtain the relations (8.17a), that is, Hamilton’s 
equations of motion. 

We shall now g eneralize the classical Poisson br ackets, which 
can be used to fincrxhe time variation of any d^amic variable, to 
^ he quantum c^e. ““ - ‘ 

I’lrst of all, we recall that in quantum mechanics physical 
meaning can be attached only to the average values of operators 
(position, momentum, and so on). It is the time rate of change of 
these average values that we must determine. The average value of 
any operator f is given in quantum mechanics by Eq. (7.12), in which 
the time t occurs as a parameter. From this equation, we can find 
the total derivative of f with respect to time : 


= J r (0 <1' (0 fl' (0 dh-{- 


( 8 . 21 ) 


Substituting for and — ^ the expressions and^- 

respectively, we can reduce (8.21) to the form 


+ 4 J [(H -P* (0) (f t (0) - 1-* (0 f (H t (0)1 d% (8.22) 


where 


Using the theorem for transferring a derivative [Eq. (7.17)] and 
keeping in mind that the potential energy is an ordinary function of 
coordinates, we readily obtain 

^ J (H<|>*(0)(f<K0)d'A:= J t*(0Hfi{-(0d’x. 

Consequently, the change off with time will be given by the equation 

=f+iHrr). 


(8.23) 
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The expression 



(8.24) 


is the generalization of the Poisson bracket (8.19) to the quantum- 
mecanical case and is called the quantum Poisson bracket. 

Obviously, in the case where ^ = 0 (as a rule an operator f does 

not contain the time explicitly), Eq. (8.23) becomes 


^i={H77r=f. (8.25) 

It follows that in this case the timechange of / is completely deter- 
mined by the quantum Poisson bracket. Furthermore, if the opera- 
tor f commutes with the Hamiltonian operator H, the physical 
quantity f corresponding to this operator is conserved, as can be 
seen from (8.25). 

With the help of (8.25) it is easy to prove that the energy of a 
particle moving in a time- independent potential field V (r) is con- 
served. The expression {H, H} (HH — HH) vanishes in this 
case and therefore from (8.25) we have 

//=!= const. (8.26) 

On the other hand, = according to the time- independent 
Schrbdinger equation, and therefore, when ([> == '{> (0, we have [see 
Eq. (7.20)] 


that is, Eq. (8.26) is nothing but the law of conservation of energy 
(£ = const) for a particle moving in a time-independent field of 
force. 


C. EHRENFEST’S THEOREM 

We shall now find the quantum analog of the classical equations 
of motion (8.17). For this purpose we shall use the quantum Poisson 
brackets. Noting that x and P;^ do not contain the time e;q)licitly, 
let us use Eq. (8. 25) to determine x and substituting into it either 
i = x or f = pj^, as the case may be. In the case of f=A:, we find 

j6={H7]^= (H;c — a:H). 


(8,27) 
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where 

h=4+^(-")- 

Since x and V (x) commute, Eq. (8.27) can be reduced to the form 

Adding the quantity {PxXpjc — to the right-hand side of this 

equation, we have 

^ ^ (p^ ( PxX — xpj + — xp^) p^) . (8.30) 

Then, using Eq. (7.7), we obtain 

i = |'. (8.31) 

TtlQ 

In order to determine the time rate of change of the momentum 
we must substitute the momentum operator Pj^ for the operator f 

in (8,25)e Then, since pj^pi — pipx = 0, we find for 

P,=TH7p7} = (8.32) 

Hence, using (8.31), we obtain 

= (8.33) 

Equations (8.31)-(8.33) constitute Ehrenfest’s theorem, according 

to which the fundamental equations of classical mechanics can be 
generalized to quantum mechanics by replacing the classical 
variables by the average values of the corresponding operators. 


D. TRANSITION FROM QUANTUM TO CLASSICAL 
EQUATIONS OF MOTION 

Let us compare the classical equation of motion 

m^x = F(x) (8.34) 

with the corresponding quantum-mechanical form (8.33). As was 
previously stated, x is the quantity which corresponds to the 
classical position coordinate in quantum theory. Accordingly, we 
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could assume the quantum-mechanical equation to be identical with 
the classical equation if we had 

m,^ = F{xl (8.35) 

Instead of (8.33). This would be equivalent to replacing .v by its 
average value ^ in the classical equation relating the force and the 
position. Ehrenfest's theorem asserts, however, that the equation 
of motion for the q uantu m case contahis the average value of the 
actual force, that is, /•'(a). Therefore, in order to make a transition 
from quantum equations of motion to cl^sical equations, we must 
first establish the relationship between /' (a) andf (jf). 

Let us represent the force operator/^ ( a) in the form 

f(A) = f(j^ + AA). (8.36) 

where Aa==a— a, and expand F{x) in a Taylor series about the 
point A = A. Then we obtain 

F W = f (X) + (A:f) F'(x) + F"(x) -I- .... (8.37) 

Taking the average of this expression in accordance with Eq. (7.12) 
and considering thattAA:) = (A: — x) = 0, we obtain 

Fj^) = F{X)^ (^)!F'(;e)-|- ... . (8.38) 

The quantum-mechanical equation of motion (8.33), therefore, takes 
the form 




(8.39) 


( A 

Here the expression -- y- (^) is the quantum-mechanical correction 

to Newton’s classical equation. Clearly, the criterion which must 
be satisfied in transition from quantum equations of motion to 
classical equations is the inequality 

(A^<2|P§|. (8.40) 

It should be noted, however, that mere satisfaction of this inequal- 
ity is still not sufficient to allow us to apply all classical concepts 
to the description of the motion of a particle. Indeed, in quantum 
mechanics the average value of the kinetic energy T is defined as 
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whereas the classical analog of the quantum-mechanical kinetic 
energy should actually be taken as 

rfe)=-5^. (M2) 

Let us n ow e xpress the quantum-mechanical definition of the kinetic 
energy 7'(pj in terms of its classical analog For this pur- 

pose, we shall use the equation 

T(pJ = Tfe + Ap,) = ^^±^^)^ (8.43) 

where = Removing the parentheses in (8.43) and con- 

sidering that after averaging 

^^=(pr— pj=o, 

we have 

- T (A,) + (8.44) 

From this we obtain the condition under which we can make a 
transition from the quantum- mechanical expression for the kinetic 
energy (8.41) to the classical expression 

(^'7)^<Pi = 2mo7’(pJ. (8.45) 

Multiplying (8.45) by (8.40), we obtain the general condition for 
the validity of the classical approximation in the microscopic world; 

(i;^)'^ . W.y < 4«io7’ {pj I I ■ (8.46 ) 

If we take into account the uncertainty relation 

(^•( 77 ' ^-4-, 

condition (8.46 ) becomes 

(Px) I I > • 

Let us apply this condition to the hydrogen atom, when 



(8.47) 
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Substituting these values into (8.47), we obtain the inequality 




3 

4' 


rtiftel • 


(8,48) 


Since — T = oto» where Aq is the radius of the first Bohr orbit, and 
r = n^aQ, we obtain instead of (8.48) 


n 



(8.49) 


and therefore, in the limit of large quantum numbers, the results 
of qua ntum the ory approach the classical results. 

^^oblei^ Determine the wave function of a freely moving electron in the 

p repres^ntatlonT Write the normalization condition in the /? representation. 

Find the average values of the operators for the momentum and energy of a particle. 
Solve the problem in the one-dimensionai case, and then generalize it to the three- 
dimensional case. 

Solution. Let us choose the x axis along the direction of motion of the electron. The 
wave function of a free electron in the x representation, normalized in terms of t (p'^ —po\ 
will have the form [see (4.81)] 


1 


To transform to the prepresentation, we use Eq. (7,10) in the form 

p 

t (Po, p) = J 'KPo, X')e * * * dx'. 


(8.50) 


obtaining 


9(P0,P) = ^(P — Pi>)- 

The normalization condition has the same form in both a: and prepresentations: 



J ^*(P'm X)dx = 

(Pi,, p) t (po, p) rfp =»(/»; — Po). 


The average value of the operators should be calculated from the equations 

Po-h^P 


M 


Po- Up 

which gives us 


Po "t ^P Po + ^P 

= J (Po, x) Ml)/ (po, x) dx = J dp; j f* (p;, p)M,f (p,, p) dp. 


Po — Ap 


P=Po, E 


Pi 

2mo * 


1 


The problems in this chapter refer to Chapters 6 and 7 as well. 
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In the three-dimensional case, we have 

<P (Po. p) = » (P — po)- 

problem 8.2, jDetermine the probability of the various values of the momentum of a 
partfllu km Uit ground state, the particle being in a one-dimensional square well with 
infinitely high walls. Verify the normalization in the p presentation. 

Solution, Taking the value of the wave function from Chapter 4 

+ (•«) =1/^-7"®’" 


and using Eq, (8,50), we have 


Evaluating the integral and squaring its modulus, we obtain the required probability 
distribution 


./XU , pi 

which satisfies the normalization condition 

00 

J i<f(p)i*rfp=i. 

— 00 

In evaluating the last integral we may use the relation 

+ CO 


J cos ap dp 
b^-p^ 


sin \ a\b 
b 


which should then be differentiated with respect to the parameter b. 


_jgbleniJ^3j|Investigate the motion of a charged particle in a constant and uniform 
electric i^lela g. 

Solution. Tliis problem is solved most simply in the momentum representation. Since, 
according to Eq, (7. 1 1), the potential energy in momentum space can be represented in 
the form 


V=-e%x = — Fx== — Fin4-, 

dp 

the corresponding Schrbdinger equation in momentum space becomes 

The solution of this equation is 






where, because of the continuity of the spectrum, the normalization coefficient was found 
from the condition for 8-function normalization: 

00 

J (E', p) <f (E, p)dp = i (£' - E). 

— 00 
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The wave function in the position space can be determined with the help of Eq. (7,9) 

where 

00 

wj cos 

is the Airy function, which is proportional to the Bessel function of order 1/3 (see 
Chapter 5), and S = j * Examining the asymptotic behavior of the Airy 

function 




for5<0. 

+ for 5>0. 


it IS readily shown that the region of large negative values of jc, where — Fx'> repre- 
sents a potential barrier, whereas the region where £ > — Fx is quasi-classlcal, 
because 


5 == ^ J pdx = J Y2mQ (E + Ex) dx + const = 


= 4- S*"'" + const. 




^oblem 8,4j l An electron moves in a constant and uniform magnetic field. Find the 
time ‘(StertVSHve of the average value of the position and momentum of the electron (in 
other words, generalize the Ehrenfest theorem for the case of motion in a magnetic fleld)u 
Solution, According to Eq. (5.9a), the Hamiltonian of an electron in a magnetic field is 




Choosing the direction of the uniform magnetic field to be along the z axis (//^ rjiL 
the field can be specified by the vector potential 




In order to determine the time rate of change of the electron's position, let us use the 
quantum-mechanical equation of motion 

v = l{Hr— rH}. 


Substituting H, we readily find 


v = 


mo 





(8.51) 


For the time derivative of the x component of the momentum operator, we obtain 


dVx i 

dt n 


{HP,-P,H)= (p, -La,)h,^L 
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Similarly, it can be shown that 


df 


Vv//,. 


dt 


Combining these equations, we have 


dt 


fiv//]. 


(8.52) 


Equations (8,51) and (8,52) constitute the required result. 


8, 5, 

described wii 
(see Chapter 6): 


\ As we know, the behavior of an electron in a metal (x < 0) can be 
a sufficient degree of accuracy by the following potential energy function 



0 , 

Ko>0, 


0 , 

jc> 0. 


Determine the coefficient of reflection from the surface of the metal for electrons located 
inside the metal (,r < 0) in the following cases: (a) E < Vq and (b) E > Vq. Show that, even 
though in case (a) the electrons do penetrate into the region (.r>0), ultimately they return 
back into the metal. Construct a graph of the change of the potential energy and of the 
wave function of the moving electrons. 

Answer. 


a) ForE < Vo, /? = 1, even though > 0) 0. 


b) For E > Fo, 




Vi 


+ /e)‘ 


Hint, In choosing the solution for 0 (outside the metal), only the exponentially 
decreasing solution should be retained In case (a), and only the solution corresponding to 
a wave traveling along the x axis in case (b). 



Chapter 9 


Elementary Theory of Radiation 


A. SPONTANEOUS AND INDUCED TRANSITIONS 

According to classical electrodynamics, an accelerated charge 
is a source of electromagnetic radiation. The amount of energy 
radiated per unit time is given by the well-known equation^ 

(S-l> 

where r = w is the acceleration of the particle. 

If the source of radiation is a one-dimensional harmonic oscil- 
lator 

ji: = acos(i)/. (9.2) 

the frequency of the emitted radiation is the same as the mechanical 
frequency of vibration of the oscillator, and its intensity is propor- 
tional to 

In the case where the motion of a charge is governed by a more 
complicated periodic function x=f{t) with a period = we can 
expand the function f(t) in a Fourier series: 


x="^aff cos (9.2a) 

k 

and treat the radiation as if it were generated by a set of oscillators 
with frequencies (i)/j = /s«), where /!= 1, 2, 3, ... . Radiation will be 
emitted both at the fundamental frequency (k=\) and at harmonics 
fto) of the fundamental frequency. The intensity corresponding to 
the k\h harmonic will be proportional to al. 

Thus, according to classical theory, the radiation of a system 
is completely determined by its mechanical properties. Indeed, the 


^In this chapter the quantum-mechanical averages will be distinguished from time 
averages by writing the latter with the subscript “av ” In accordance with the previous 
notation, quantum-mechanical averages will be indicated by a bar. 
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frequency of the radiation is either equal to or is a multiple of the 
mechanical frequency of oscillation of the system, and the intensity 
of the corresponding harmonic is proportional to the square of the 
amplitude. 

In quantum mechanics, the problem of radiation must be ap- 
proached in a somewhat different manner. According to quantum 
theory, radiation is emitted only when a particle (or a system) 
makes a transition from one energy state to a lower energy state 
(so-called “downward” transition). 

The first quantum treatment of the problem of radiation was 
proposed in 1917 by Einstein. He introduced the coefficients A and 
B (now called the Ems fern coefficients) to char acterize the spontane- 
ous transitions and the induced transitions (that is, transitions due 
to some external effects) of a system from one energy level to 
another; Einstein also obtained an equation relating these two 
coefficients. 

The basic elements of the quantum theory of radiation are the 
following. Suppose one of the electrons of an arbitrary atomic 
system is in the excited state ti with an energy Then there is 
a definite probability Ann' per unit time of a spontaneous transition 
of this electron into a lower energy state n' with an energy 
The transition is accompanied by the emission of a photon with an 
energy ho) = En — En>* If the number of excited atoms is equal toA^n» 
the energy radiated per unit time during spontaneous transitions 
only can be written as 

=NnAnn^noK (9.3) 

When the atoms are subjected to the influence of external electro- 
magnetic radiation, the latter will cause both upward and downward 

induced transitions. The upward transitions 
will, of course, be associated with the ab- 
sorption of photons. 

Adopting the notation introduced by Ein- 
stein, we designate the probabilities of an 
induced transition from level n to n' by Bnn’ 
and from level n' to n by Bn'n> Since the 
number of induced transitions should be 
proportional to the spectral energy density 
P«, of the external radiation, we obtain the 
following equations for the energy radiated 
and absorbed per unit time in induced 
transitions: 

Km 


€n 

Bft'n 


En- 

Fig. 9. 1. Downward tran- 
sitions (spontaneous and 
induced) and upward 
transitions (induced) 



where is the number of atoms in state n'. 


(9.4) 

(9.5) 
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Let us consider the case in which the number of upward and 
downward transitions is the same (see Fig. 9.1): 

“f" ^nPto^n/i' = ^n'Pa)^n'n> 

that is, when a state of thermodynamic equilibrium exist between 
the heated atoms and the light radiated by them (black-body radi- 
ation), which in turn interacts with the atoms. In this state, the 
atoms and the radiated light form a closed system. 

Since, in this case, the energy distribution of the electrons is 
given by the Maxwell distribution 


we obtain 

Dividing by the factor and noting that — we obtain 


K = . (9.8) 

‘^n'n hio/kT , 

Since the spectral energy distribution of black-body radiation is 
completely independent of the specific structure of the atoms or 
molecules involved, Eq. (9.8) is essentially the same as Planck’s 
formula [see (1.42)] 


P.0 






j • 


(9.9) 


omparing (9.8) with (9.9) we find 

= -^3 - (9. 10) 

It is seen from Eq. (9.10) that the probability coefficients of upward 
and downward induced transitions are equal to each other and 
proportional to the coefficient of spontaneous transition There- 
fore, to describe the radiation of atoms or molecules, it is sufficient 
to determine only one of these coefficients. 


B. CALCULATION OF PROBABILITIES OF SPONTANEOUS 
AND INDUCED TRANSITIONS 

In quantum mechanics induced transitions are explained in terms 
of an interaction between the electrons of an atom and external 
electromagnetic radiation. The problem of determining the causes 
of spontaneous transitions was left unexplained by the Schrodinger 
theory. 
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The answer was obtained only after the development of a theory 
of radiation in which quantization of the electromagnetic field 
(second quantization) was used. The general features of the theory 
are outlined below. 

Electrons interact not only with real photons, but also with 
virtual photons (photons which are in an unobservable state) or, as 
they are called, vacuum fluctuations of the electromagnetic field 
(for further details on vacuum fluctuations, see Chapter 22). This 
interaction causes spontaneous transitions. The classical analog 
of the interaction between the electrons and the field of virtual 
photons is the effect of Planck’s radiation damping on a moving 
electron 


^rad — ^ ^ » 

which represents the self-interaction of the electron with its own 
electromagnetic field. Under certain conditions this electromagnetic 
field may detach itself from the electron in the form of electro- 
magnetic radiation. In the language of quantum electrodynamics 
this amounts to a transition of photons from a virtual state into a 
real state. 

The exact expression for the coefficients A and B can be found 
on the basis of quantum electrodynamics and, therefore, problems 
of radiation can be completely solved.^ 

In the present discussion we shall obtain coefficient A by means 
of an appropriate generalization of the results of classical radiation 
theory to the quantum case. It should be emphasized that this 
generalization leads to the same results as the rigorous method of 
second quantization. 

In our derivation we shall use the correspondence principle to 
generalize the classical expression for the radiated energy [Eq. 
(9.1)] to the quantum case. First, we replace the classical variable 
r by the quantum-mechanical quantity 

(9.11) 

In addition, we use an expression for the radiated energy which is 
consistent with the quantum theory: 

= Sngn-fl<»A„„: ( 9 - 12 ) 

where the coefficients gn and gn' characterize the occupancy of 
states n and n' by electrons, since according to the Pauli exclusion 


2 

See also Chapter 29, where the coefficient A is obtained by the methods of quantum 
electrodynamics. 
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principle it is impossible for two electrons to be in the same 
quantum state (for more on the Pauli exclusion principle, see 
Chapter 24). 

Combining Eqs, (9.12) and (9.1) and substituting (9.11) we obtain 

= (9.13) 


Let us note that Eq. (9.13) contains two averages. One is the 
quantum -mechanical average, denoted by a bar, and the other is 
the time average, denoted by the subscript “av.** 

We shall now assume that the electron has only two possible 
states with energies En and Then the wave function can be 
written as 








(9.14) 


The average (over the quantum-mechanical states) value of the 
radius vector is 


f=\C, Wnn + 1 c,. 

where 


The matrix elements 

rn'n = ^ iPx 
form a certain infinite matrix 


(9.15) 

(9.15a) 


(9.16) 


(/-) = 


roo rn 

ru rn rn 

r^o rn rn 


(9.17) 


From (9.16), it follows that this matrix changes into its complex 
conjugate when the rows are replaced by columns, and columns 
by rows 


rnn* rn'rf ' 

Matrices satisfying this condition are called Hermitian or self- 
adjoint matrices. Let us also emphasize that the matrix elements 
(9.16) are independent of time, and therefore substitution of (9.15) 
into (9.13) yields 


gngn'no)Ann' 


(9.18) 
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Here we have used the fact that the time average of a periodic 
function is zero, since 






1 \ ^± 2 / 0 ,/ 


For further analysis of Eq. (9, 18), we must introduce an additional 
assumption, which can be rigorously justified only on the basis of 
quantum electrodynamics. As we already know, quantum mechanics 
deals with stationary processes and, therefore, there is no am- 
biguity in interpreting the quantity = const as the probability 
of finding an electron in the state n. When the emission of radiation 
is present, the coefficients C„ change dis continuously and their 
physical meaning cannot be simply explained within the usual 
formalism of quantum mechanics. We shall, therefore, base our 
conclusions on simple physical considerations, which are rigor- 
ously proved only in quantum mechanics. 

Let us substitute the initial values for the coefficients into 
Eq. (9.18), bearing in mind that the Pauli exclusion principle allows 
transition only in the case when the quantum state n is initially 
occupied, while the quantum state n' is empty. Then setting 


gngn' = I C. n r = I (1 ^ I Q 1^, (9. 19) 

we find that for Cn = l and C,;' = 0 the product 1. Hence 



(9.20) 

D p 4n*e^ j 

^nn' — ^n'n ‘ * 

(9.21) 


(9.22) 


In these equations 

1 rn'n 1"^ = I Xn'n |* + | gn'n P + ! \\ (9.23) 

where 


Xn'n — ^ ^n'X^n 

and so forth. 

Thus, the energy eigenvalues can be used to find the frequency 
of the radiation and the eigenfunctions to find its intensity. Thus, 
all basic classical radiation properties can be completely general- 
ized to the quantum case by means of the Schrodinger equation. 

From the last equation above it is evident that the intensity of 
radiation will be different from zero only for those transitions for 
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which at least one of the matrix elements Unn' and Znn' is non- 
zero. These transitions are called in quantum mechanics the 
allowed transitions. 

It should be noted that in very many quantum-mechanical 
problems it is sufficient to calculate the matrix elements alone and 
thus to set up selection rules, that is, to find the changes in quantum 
numbers that correspond to allowed transitions. From a knowledge 
of the selection rules, one can answer the question of possible 
frequencies of radiation. In the language of classical electro- 
dynamics, the selection rules correspond to a specification of the 
harmonics at which radiation can be emitted by a given system. If 
the matrix elements for a given change (difference) in quantum 
numbers are equal to zero, there will be no radiation at the corre- 
sponding frequencies, and these transitions are said to be forbidden. 

Here, in speaking of forbidden frans^7^ons, we are restricting the 
use of this term to electric dipole transitions. By electric dipole 
transitions we simply mean transitions whose probability depends 
on matrix elements 


rn'n = ^ , d'^x. 

In addition to the dipole transitions, there are also cases of quad- 
rupole transitions, multipole transitions of higher orders, and 
magnetic dipole transitions. The intensity of these transitions turns 
out to be much smaller than that of the allowed dipole transitions. 
As an example, if the intensity of an electric dipole transition is of 
the order 


^ dipole 


— lit 

“ 3 



where a is the linear dimension of the atom, then the intensity of 
electric quadrupole radiation is of the order ^ 


^ dipole * (x) • (9.24) 

For an atom 10 ® cm and 10 ®cmand, therefore, the intensity 
of dipole radiation is 10 times greater than that of quadrupole 
radiation. Nevertheless, quadrupole radiation plays a very important 
part in a number of phenomena. Indeed, if the electric dipole 
transition is forbidden, it is still possible that a weak quadrupole 
radiation will beemitted, which can be detected with a very sensitive 
spectroscope. We note that no dipole radiation occurs in a system 
consisting of particles having the same charge to mass ratio. The 


^This subject is treated more fully for the case of a harmonic oscillator in Chapter 
[see Problem (10.4)]. 
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electric dipole moment of such a system is proportional to the 
coordinate of the center of mass (^1 + ^2) = and 

therefore the derivative of the dipole moment with respect to time 
vanishes. This is the situation that should hold for gravitational 
radiation, since the gravitational charge, or rather the gravitational 
mass, is proportional to the inert mass mu. Therefore, if gravita- 
tional radiation does exist at all, it can only be of quadrupole 
character. Quadrupole radiation is also of importance in nuclear 
physics since the charged particles of the nucleus (protons) have 
the s ame ch arge and mass."^ 

(^Problem 9 . iX Find the probability of quadrupole radiation in the quantum case as a 
gen5WII?!5TTWr8r the classical formula by applying the corresponding principle. 

Solution. In the classical case the intensity of quadrupole radiation is given by the 
equation 


W = 


1 

ISOc'^ 


(Oab)\ 


(9.25) 


where the quadrupole moment is 

Oab = (a, b = 1,2,3). 

To generalize (9.25) to the quantum case, it is necessary to consider that, in quantum 
theory, radiation occurs as a result of a transition of the system from one quantum state 
n to another, n\ Following the procedure similar to the derivation of Eqs. (9.13)-(9. 18), 
we first replace the classical expression for the quadrupole moment Dab by the matrix 
element 

Next, using Eq, (9.3), which relates the intensity of radiation Wn'n io the emission 
probability /I, we obtain 


90 c^n 


ADn'n) ab i^n'n)(ibf 


(9.26) 


where the frequ ency of radiation w is given by Eq. (9. 15a), 

problem 9.2A Find the selection rules for dipole and quadrupole radiation for a parade 
in aJrtBHflfWtytreep potential well. 

Answer, For dipole radiation must be an odd number, and for quadrupole radia- 
tion ^n is an even number. 

Hint. Using the wave funcaon (4,32), it can be shown that the average value of the x 
coordinate is 


X = 



2 * 


Therefore, the matrix elements corresponding to dipole (/= 1) and quadrupole (y = 2) 
radiation should be calculated from the equation 



^See W Heitler, The Quantum Theory of Radiation, 3rd Ed., New York: Oxford Univer- 
sity Press, 1954 
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The Linear Harmonic Oscillator 


The problem of the linear harmonic oscillator is one of the most 
important problems of theoretical physics. It arises in connection 
with the construction of a simple thoery of vibrations in many dif- 
ferent branches of physics (mechanics, classical electrodynamics, 
radio physics, optics, atomic physics, and so on). Most of the new 
theories developed in atomic physics have been 'Hested^’ in a 
number of simple problems, including, in particular, the problem 
of the harmonic oscillator. 

The motion of a complex system can often be analyzed in terms 
of the normal modes of vibrations that are equivalent to the vibra- 
tions of harmonic oscillators. For us the harmonic oscillator 
presents additional interest for methodological reasons because this 
problem has an exact solution and thus provides a simple illustra- 
tion of an application of the Schrodinger equation. The harmonic 
oscillator problem was also very useful in the development of 
quantum field theory (second quantization) and in the analysis of 
the zero-point energy of the electromagnetic field (vacuum fluctua- 
tions). In quantum mechanics, the harmonic oscillator problemhas 
direct applications in regard to the black-body radiation, molecular 
spectra, and the specific heats of diatomic molecules. 

A. THE OSCILLATOR IN THE BOHR THEORY 

We shall first consider the classical theory of the harmonic 
oscillator.* Let us assume that a certain particle of mass rn^ Is 
acted upon by an elastic force 

F=—kx, (10.1) 

where k is the elastic constant. Then the equation of motion for the 
harmonic oscillator can be written in the form 

nl^,x=—kx, ( 10 . 2 ) 

which describes an ordinaiy oscillatory process. 

*In this section we shall be concerned with the case of one-dimensional motion and, for 
simplicity, we shall use the term “harmonic oscillator* rather than “linear harmonic oscil- 
lator.” 
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The solution of this equation has the form 

a: = a COSO)/, (10,3) 

where (d = --==1/ ^ is the angular frequency and a is the ampli- 

X f mo 

tude of oscillation. From Eq, (10,3), it follows, in particular, that 
the acceleration 

w = x— — ao)‘^ cos {fit (10.4) 

differs from zero and, consequently, that the oscillation of a charged 
particle will be accompanied by radiation, the intensity of which 
(that is, the radiant energy) is given by the following equation in 
accordance with Eqs, (10,4) and (2,2): 


w^cl=|“ = (10.5) 

In deriving (10,5), we calculated the average value of cosV from 
the equation 

T 

-i(cosVd/ = i. (10.6) 

6 

We shall now express the intensity of radiation in terms of the 
total energy E = T-{-V of the harmonic oscillator. From the well- 
known equations for the potential energy 

1/ (Jt) = — i F (Jt) dx ^ (10.7) 


and kinetic energy 


r= 


moJtl 

2 




sinW 


( 10 . 8 ) 


of a harmonic oscillator, we find 

£ = !/(;»:) + 7’= = const. (10.9) 


With this equation, we can eliminate the quantity from (10,5), 
obtaining 


^ JmoC* 


(10.10) 


Thus, on the basis of classical theory one can determine both 
the intensity and frequency of the radiation; it is also found that this 
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frequency is the same as the frequency of mechanical vibrations of 
the harmonic oscillator. The energy of the harmonic oscillator, 
according to the classical theory, can have any value in a continuous 
range from zero to infinity. 

Several new features were introduced in the problem of the 
harmonic oscillator by the Bohr quantum theory. For example, 
according to Bohr^s theory, the energy levels had tq be discrete 
and could be found from the quantization rule 

j.p,dx=2-nn. ( 10 . 11 ) 

where 

= S <10-12) 

Let us substitute p^dx — dt == //yoV sin W(// into Eq. (10.11). 

Then, taking into account Eq. (10.9) and integrating over a complete 
period, we find 

= (10.13) 

where the quantum number n = 0, 1, 2, 3, . . . . 

We showed above that, according to Bohr’s theory, the energy 
of a harmonic oscillator can take only discrete values, and radiation 
will be emitted only when the oscillator makes a transition from 
one energy level to another. 

The discovery of a discrete spectrum of energy levels of a 
harmonic oscillator played an important part in the theory of 
black-body radiation. Planck’s law was first obtained under the 
assumption that the harmonic oscillator could radiate and absorb 
light only in the form of discrete quanta of energy nw. 


B. EIGENFUNCTIONS AND EIGENVALUES 
OF THE ENERGY 

In order to determine the behavior of the wave function in the 
harmonic oscillator problem, let us first give a graphical repre- 
sentation (Fig. 10.1) of the dependence of the potential energy V 
on X 


1/ 

V — —^ . 

From the graph, it is seen that inside the potential well, where the 
total energy E of the harmonic oscillator is greater than V (E^V), 
the solutions for will take the form of harmonic functions. Inside 
the potential barrier {E C^V) , the solutions will contain two parts. 
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one exponentially decreasing and the other exponentially increasing 
(see Fig, 10.1). It is clear that the solution of the problem reduces 
to finding the conditions under which there is no exponentially 
increasing solution. Just as in the case of a rectangular potential 
well with infinitely high walls (Chapter 4), such levels exist only 
at certain discrete values of the energy, which we must determine. 



Fig. 10. i. Wave function of the harmonic oscillator for an 
arbitrary value of the energy. 


Since the potential energy V of a harmonic oscillator depends 
only on the x coordinate, the Schrodinger equation can be written as 

^^0. (10.14) 

Setting 

^ □ 1 f//o« ot 2/r 

/i- '’ :v^ /i » 

and introducing a new variable 


^ = - (10.15) 

'o 

we obtain 

r + (10,16) 

where 

(10.17) 


First, let us find the asymptotic behavior of the wave function 
at ' oj, that is, when the constant X is negligible in comparison 
with . Then 




(10.18) 
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We shall seek a solution of this equation in the form 


Since 
we find 


and, consequently, 






(10.19) 

(10.20) 
(10.21) 


Since the wave function must remain finite at coefficient 

Cl must be set equal to zero. Coefficient Q can be taken to be equal 
to unity, since the wave function has not yet been normalized. Thus, 
the asymptotic behavior of the wave function (p is described by the 
function 


= (10.21a) 

We shall seek a solution of the wave function in the general form 

= ( 10 . 22 ) 

which already takes in account the behavior at infinity. Substituting 

(10.22) into (10.16) and considering that 

= [ir 2W + (E* — 1 ) //] 
we obtain the following equations for u : 

u'' — 2W + (K — l)u==0. (10.23) 

Let us look for a solution of this equation in the form of a series 

*^0 

Substituting this expression for u into Eq. (10.23) and collecting 
terms with the same power of 5 , we find 


2^*[(A + 2)(/e+l)&*^,-6*(2*+l-X)] = 0. 

k^O 

Equating the coefficients of to zero, we obtain a recursion 
formula for the coefficients bf, 




(2k + l— X) 
()j+2)(/j+ir 


(10.25) 


This formula relates the coefficients bf^ to and, therefore, the 
series (10.24) will consist of even powers (if the minimum subscript 
k is even) or odd powers (if the minimum subscript k is odd). 
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If the series (10,24) does not terminate at a certain maximum 

power, then beginning with every term is positive and, 

consequently, the series diverges for large values of ; . This leads 
to the second asymptotic solution <l»asym at ;~^co , which we 

disregarded earlier because it diverges.^ Therefore, in order for 
the boundary conditions to be satisfied ((|^ — Oat ^ - .o), we must 
terminate the series (10.24) at a certain ^max==^^- We thus require 


bn-AO, b„,i — 0. 

(10.26) 

From (10.26) and (10.25), we find 


X = 2n+ 1, 

(10.27) 

and, consequently, 


£^ = ^0)(/2 -f- yj 

(10.28) 


where n can assume any positive integral value, including zero. 
These are the only energy values for which the wave function 
vanishes at infinity. 

Comparing this expression with the one obtained from the Bohr 
theory [see (10.13)], we note the appearance of a term called the 
zero -point energy 

= | /io). (10.29) 

Later, we shall show that the existence of the zero-point energy 
is related to the uncertainty principle and thus to the wave properties 
of particles. The zero-point energy does not affect the frequency 
of the radiation, however, since it cancels out in the expression 

for the frequency . 

Let us now find the wave function of the harmonic oscillator. 
The recursion formula (10.25) for thecoefficients when X = 2n -j- 1 
takes the form 

b - b 


^This follows from the fact that at targe k the ratio of tha coefficients 
k/2 IS the same as for the series expansion of the function p- 


^2 


2 

= 0.2 



Therefore, 


^asym 'roo ^ 
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where /e < /? . 
equal to^ 

we obtain 


Setting the coefficient of the highest power 

= (10.30) 


t^/l-2 ^ “ |— » 

6„_4 = 2'-* ,fcO (" = ?) .. (" z. g L and so forth. 


(10.31) 


The power series with a finite number of terms obtained for the 
function u is called the Hermite pol 5 momial 


u = (5) = (2 0" - - (2^0"-^ + 

n(n -\)( n —2) (n—3) ^ 2 ?)'!-* . . -j_ | for odd « (10.32) 

1 6,, for even n 

In particular, 

//o(?)=l. Wi(5) = 2;, //,(;) = 4f''-2, 

= (10.33) 

The Hermite polynomials Hn(^) can be written in closed form"^ 






(10.34) 


■^This coefficient can always be chosen arbitrarily, since the normalization factor of 
the wave function is still undetermined 

“^To show this, we introduce the function v = e ", which satisfies the equation 

v' {- 2^v -- 0 

Differentiating this equation n + 1 times, and using the Leibnitz formula 

2 ' 

we obtain 

j^(n + 2)^ + f 2(rM 1) - 0 

Making the substitution 

i;(n) ^ c~^^w , 


we find that the function w satisfies Eq. (10 35), and thus it is proportional to the Hermite 
polynomial 


^2 ere 

w - 


AH 


n 


The proportionality factor can be found by equating the coefficients of As a 

result, it is found that A^ - (-I)”, from which we obtain Eq (10 34). 
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From (10.32) it is clear that //„(;) satisfies Eq. (10.23) pro- 
vided X==2/H"l 

Hn — min + 2nHr, = 0 (10.35) 

According to (10.22) and (10.32), the solution of the Schrbdinger 
equation for a harmonic oscillator is 

(10.36) 

where $ is related to the coordinate x by Eq. (10.15). The coef- 
ficient can be determined from the normalization condition 

'j 05 

jj = .v„a J e--//„ (;) H„ ($) d-=l. (10.37) 


Substituting the closed form (10.34) for one of the polynomials 
//„(;), we obtain 





d\ 


1 . 


(10.38) 


Using the rule for transferring the derivative of one function to 
another [see (7.17)] (that is, we integrate by parts a times), we 
obtain 







(10.39) 


Noting that from (10.32) 


(V^_ 


HA^) = 2^n\ 


and 


oo 

^ e~^“d^= \ Tz , 

— oo 


(10.40) 


(10.41) 


we find 


that is. 


Cn 


1 

V U A'o 




1 

y-l^nYyTZ 


2 



Hn 



(10.42) 
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In a similar manner, we can easily prove the orthogonality condition 
for the wave functions. To do this, it is enough to represent one of 
the Hermite polynomials, specifically, the one with larger /z, in 
closed form (10.34). The orthogonality condition also follows from 
the general investigation of the Schrddinger equation; it can be 
proved that the eigenfunctions corresponding to the different eigen- 
values are orthogonal. 



Fip, 10.2. The energy eigenvalues and the 
behavior of the corresponding eigenfunctions 
of the harmonic oscillator for small quantum 
numbers (n =- 0, 1, 2) For comparison, the 
classical probability distribution functions 
are indicated by the dotted lines 


In the case of a harmonic oscillator, the orthonormality condi- 
tion is 


-f-OO 

—00 


(10.43) 


For small quantum numbers /i = 0, 1, 2, ... , when 

= «l;i = Ci2?e 2 » 

£.^==-^^ 10 , = — 2)e 2 ' ^ 


(10.44) 
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the probability distribution functions (see Fig, 10.2) differ 

considerably from the corresponding classical probability functions. 
In the classical case the probability of a particle being at a certain 
point is proportional to the amount of time the particle spends there, 

and, consequently, is inversely pro- 
portional to the particle velocity; 
therefore, the classical probability 
is proportional to^ {a^~x^)~^^\y/here 
a is the maximum displacement of 
a classical oscillator from the equi- 
librium position. As we would ex- 
pect, it is only for large quantum 
numbers that there is a relatively 
close agreement, on the average, 
between the quantum and classical 
probabilities (see Fig. 10.3). 


C. ZERO- POINT ENERGY OF THE 
HARMONIC OSCILLATOR AND 
THE UNCERTAINTY PRINCIPLE 

We have seen that in quantum 
mechanics the minimum energy of 
the harmonic oscillator is given by Eq, (10,29) and cannot go to 
zero, whereas in the classical theory or the Bohr theory, the 
minimum energy is equal to zero. 

We shall now show that th e existence of the zero- point ener gy 
(10.29) in the Schrbdinger theory fs.. as mentioned above, very 
closely re^^ated to tha_uncertaintv principle ( 8.11), For the'ca^e 
of a harmonic oscillator the uncertainty principle becomes 



Fip;- 10.3. Comparison of quantum 
and classical results for the oscil- 
lator in the region of large quantum 
numbers (here n - 10). 


'’This can be easily shown on the basis of the following simple qualitative consider- 
ations. The probability of finding a particle at a particular point can be roughly charac- 
terized by the absolute value of the reciprocal of its velocity, since the time a particle 
spends in a region will be greater in regions where the velocity is smaller than in regions 
where it is greater. Consequently, the probability of finding a particle in a region with 
larger velocities will be smaller than that for a region with smaller velocities. In the 
case of a harmonic oscillator, we have from Eq. (10.3) x/a = cos cot and x/ coa - —sin cot. 
Taking the square of both equations and adding them, we get 

and, therefore, 

1^1 



THE LINEAR HARMONIC OSCILLATOR 


59 


Here we have replaced (Aa:)^ by and (Apf by p\ This is justified 
by the fact that the wave functions are real and are either even or 
odd. Indeed, since the expression = is odd, we have 

x = { ^'^'x^dx = 0. 


Hence 


Similarly, using the boundary conditions at infinity, we find 


^ f I ^ ^ f ‘2 A 


that is. 


(Apf = /}'* — (/))’ = ju'. 


Substituting the value of from (10.45) into the equation for 
the total energy 

E=.H=. (10.46) 

we obtain 





2 


(10.47) 


From this it is seen that the energy E cannot vanish at any value 
of(A:‘^). Indeed, although the second term vanishes for (;c^) = 0, the 
first term becomes infinite. Conversely, when (.^^'^) = oo, the first 
term vanishes and the second becomes infinite. 

Thus, the fact that differs from zero is directly connected 
with the uncertainty relation (10.45) or, in other words, with the 
fact that it is impossible to calculate exactly the position and 
momentum simultaneously. 

Let us find that value of (a:^) at which Eq. (10.47) has a mini- 
mum, Setting the derivative of this function with respect to (a:*) 
equal to zero, we obtain 
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Substituting this into (10.47), we have 


fliji I Um flea 

■ T" > T" “9" • 


(10.48) 


Hence, ^min=Y» which is exactly the same as the value for Eq found 

from the wave theory [see (10.29)]. 

The existence of a finite zero-point energy of the harmonic 
oscillator is one of the most characteristic manifestations of the 
wave properties of particles. Thus, the experimental verification 
of the zero-point vibrations was of great significance for quantum 
mechanics. The zero-point energy Eq was first observed experi- 
mentally in the scattering of x-rays by crystals at low temperatures. 
If there were no lattice vibrations at low temperatures (£(, = 0), as 
predicted, for example, by the Bohr theory, there would be no inter- 
action between the x-rays and the crystal lattice, and consequently 
no scattering would occur. K, on the other hand, the minimum 
energy were different from zero (Eq-AO) for T 0, the scattering 
cross section at low temperatures should approach a finite limit. 
E;q)erlments have confirmed that the second situation corresponds 
to the true state of affairs and, therefore, the conclusions of the 
Schrbdinger wave theory are justified. 


D. SELECTION RULES. INTENSITY OF RADIATION 

Let us consider the problem of radiation from the harmonic 
oscillator on the basis of wave mechanics. For this purpose, as 
was Indicated in Chapter 9 [see (9.22)], we must calculate the 
matrix elements 


Xn'n= j| (10.49) 

where <]^^is given by Eq. (10.36). 

As a preliminary step, we shall derive a recurrence relation 
for the Hermite polynomials, which will be necessary in our further 
discussion. From the definition (10.32) of the Hermite polynomials 
//„(5), we find 

Hn (5) = 2n I (25)'- - (25)'-’ + ...] = (5), 


from which It follows that 


H'n (5) = 2nH'„. , (5) = 2rt • 2 (n - 1) (5). 


(10.50) 
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Next, substituting these equations for the derivatives into Eq. 
(10,35) and replacing U we obtain a recurrence relation for 

the Hermite polynomials 

= ($). (10.51) 

By means of this formula, the matrix element (10.49) can be reduced 
to the form 

4 00 -foo 

= xlC„Cn' (?) //„ (?) d? + n' J (?)//„(?)d?) . 

— CO — CO 


In terms of the wave functions tj), we obtain 


C c 


Since the functions ^l^nSire orthonormal, we have 

Xn: =Xo {y 8„'+l, „ + (« + 1) S/.'-!,. } . 


(10.53) 


From this expression it follows that the only nonvanishing matrix 
elements are those for which n' — n — 1 or n' = n~\-\; therefore, 
the selection rules for the quantum number n are expressed by the 
equation 

An = n — n' = ±: 1, (10.54) 

which indicates that only transitions between neighboring levels are 
possible. 

For the wave function as given by Eq. (10.42), we obtain from 
Eq. (10.53) 


^n-l, n — ^0 .7 > 

y y 1 / ^ 

— '''0 y 2 * 


(10.55) 


where Xy is determined from Eq. (10,15), For the frequency of the 
radiation, we obtain exactly the same expression as in the classical 
theory: 


"rt. n-t = 




n 


( 0 . 


(10.56) 
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The energy levels and allowed transitions are shown in Fig. 
10.4. 

Since spontaneous emission is possible only when transitions 
occur from higher to lower energy levels it follows from 

(9.22) that the intensity of radiation 



CO 


CO 


U) 



-Ejf, 

■ e,/h 

•Eo/f' 


of the harmonic oscillator W 
is 


n, rt-l 


TT7 ^ 6 


~~ 3 moc* ■ 


•£«). 


(10.57) 


I' lp. 10 4. Allowed transitions of the 
harmonic oscillator 


Comparing this equation with Eq. 
(10.10), which was obtained from the 
classical theory, we see that for 
large quantum numbers (/i^^> 1), when 
En \ E^^, both equations yield practic- 
ally the same result. Transitions to higher energy levels n-^n + 1 
are possible in the case of induced transitions. The occurrence of 
spontaneous upward transitions is also possible under the condition 
that the energy loss in the harmonic oscillator is compensated by 
the simultaneous liberation of a large amount of energy, as for 
example, in transitions of atomic electrons (see Chapter 12, 
spectra of diatomic molecules). 

Find the eigenvalues of the harmonic oscillator using the WKB method, 
T^^ordlng to (10.14), the wave function for the harmonic oscillator is 


where 


2moB 




^ / R2 = ^ 

According to Eq. (5.75), the eigenvalues are determined from the equation 

Va/? 


^ a — [i-.v* dx — TZ j . 


— > «/(i 

Evaluaung this integral, we find that the energy eigenvalues are the same as in (10.28); 
that is, the zero-point energy is also present. 


yfroblemlOj^ Construct the theory of the harmonic oscillator in the prepresentation 
(for tn^ on^-ciimensional case). Find the equation of motion, the eigenvalues and the 
eigenfunctions. 

^3 

Solution. Since x” = — in the p representation, we can write the Schrodinger 

equation as 


2wo 


+ - 


dp- 


■)<p 


(;») = 0 ; 
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that is, transforming the wave equation for the harmonic oscillator from the a' representa- 
tion to the representation and introducing the new parameters 


X 


'in 

fl(0 * 


P_ 

Po* 


where 


Po~ V 


we find that the wave equation changes identically into itself 

(the prime indicates the derivative with respect to t)). Using the solutions (10.28) and 
(10.42), we can write in the p representation 


and 


fn (P) = 


1 


e 


2 


”■) 

Pc’ 1 1 



It is easily verified that this wave function satisfies the normalization condition 


t^roblem 10.y Find the eigenfunctions and energy spectrum of an electron (e = — e© < 0) 
movtflg*trfW?OfflS&nt, uniform magnetic field. Show that, according to the quantum theory, 
the “electron gas“ must be diamagnetic. 

Solution. Let the magnetic field be directed along the z axis(//v = I Jy = 0,//^=e5^). 
We can then write for the components of the vector potential = 0. 

The motion of an electron is described by the Schrodinger equation (see ^so Problem 8.4) 




Ip 

dy'^X 27no?' 


^ = 0. 


Since the coordinates y and z do not appear explicitly in this equation, we shall look 
for a solution in the form 


For the function /(a), we obtain the equation 


d^f 

dx^ 



2moC^ 


(,v+5)^)/=0, 


where 




r^ki 
2mo * 


h = 


ctiki 


It is easily seen that this equation has the same form as Eq. (10.14) for the harmonic 
oscillator. 

Consequently, we can use solutions (10.27) and (10,36) to determine the eigenfunctions 
and eigenvalues. We thus find 


~'L 




e 


(10.58) 
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where (;) is the Hermite polynomial, J normalization co- 


efficient, o : 


2moC 


is the frequency of Larmor precession, and 




For the eigenvalues, we have 






n^k\ 




The last term in this equation is simply the kinetic energy of a free electron moving 
along the z axis, and is of no special interest. 

The first term 




(10.59) 


where fio is the Bohr magneton, corresponds to the additional energy acquired by an 
electron in the magnetic field. ITils additional term represents the energy of electron 
motion in the xy plane, which is perpendicular to the magnetic field. 

This conclusion is in agreement with classical theory, according to which an electron 
placed in a magnetic field processes with the Larmor frequency o in a plane perpen- 
dicular to the magnetic field. 

In the classical theory, however, the energy of an electron in a magnetic field is 
determined entirely by its unquantized kinetic energy, llierefore, according to the clas- 
sical theory, an electron gas generally exhibits no diamagnetic properties. 

In the quantum theory the energy (10.59)canbe interpreted in terms of the appearance 
of an additional magnetic moment p of an electron, which makes the following contribu- 
tion to the energy: 


/; magn = _ |x^ ( 10 . 6 O) 

Comparing (10.60) with (10,59), we find 


= (2n+ I). 


Since the number 2n -f I assumes only positive values (/z =:0, 1, 2, 3, . . .), the addi- 
tional moment of an electron will be directed along the negative z axis. This naturally 
leads to the diamagnetism of free electrons in a metal. 

It should be noted that in quantum mechanics, solution (10.58) corresponds to harmonic 
vibrations along the z axis along, whereas in classical theory the circular trajectory 
means that there are harmonic vibrations along both .v andy axes, with a phase difference 

of-^. The reason for this is that the energy is independent of the momentum Con- 
sequently, degeneracy occurs and the solution for a given energy has the form 

^ » ( 10 . 6 1 ) 

where the coefficients are arbitrary amplitudes satisfying the normalization condition 

In classical theory there is also an indeterminacy, since the center of the circular trajec- 
tory is not uniquely specified. 

The general solution (10.61) corresponds to a set of circular trajectories having 
different centers located along the y axis. 
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Concluding the above discussion, let us note that the solution (10.61; includes the 
harmonic oscillations along both x and ,v axes. This can be seen by examining the expres- 
sion for the energy [Eq. (10.59)] which represents a sum of the energies of two harmonic 
oscillators (note that ixo,^ = o/j) 




jlem 10.4^ Show that the matrix element of the product of two operators M (a) and 
v), wRTcR' Ifrtndependent of quantum numbers, is equal to the sum of the products of 
the matrix elements of these operators, that is. 




(10.62) 


Solution. Writing 

(MN)„,„ = ^ ^ cj/*, (.v) M (x) i (X — X') N (x') i|/„ (a-') dx dx' 
and using the relation (see Chapter 4) 


» (A — A') = 2 (A') (a), 

k 


together with the fact that the operators M (,v)andN (jir’)are independent of quantum num- 
bers, we readily prove Eq. (10.62). 


)lem 


problem 10. 5j Find the selection rules for quadrupole radiation emitted by the 
haralOfUfi '^SHTator. Find the intensity of spontaneous quadrupole radiation and compare 
It with the intensity of dipole radiation, c5btain Eq, (9.24), which relates the intensity 
of quadrupole radiaaon to that of dipole radiation. 

Solution. In Problem 9,1 we found that the quadrupole radiation is proportional 
to the matrix element, which according the the preceding problem can be written as 


n'k^ k 


Substituting the values of from Eq, (10.55), we find the following nonvanishing matrix 
elements of the quadrupole radiation: 

(•«’)«-«, n = /n(n— I) ; 

/(«TW+T), {x‘)n,n=-xl[n + ^'). (10.63) 

That is, the selection rules for the quadrupole radiation of the oscillator are 

A/i = 0, H[: 2. 

7Tie probability of spontaneous emission (n' = n — 2, A/i =. 2) is calculated from Eq. 
(9,26). For large quantum numbers when E = we have 

\;^quad — JA 

15 • 


According to (10.57), the intensity of dipole radiation is 


u^pole_ _2 e^i^^E 
3 OToC* 
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Using the last two equations^ we obtain 


W'dlpole“ 5 i e j “ ~ U / ’ 


where a* = = is the square of the classical amplitude of oscillations. This relation is 

//io<o* 

in agreement with Eq, (9.24). 

^Problem lO.y Show that the center of a wave packet composed of the solutions for 
the iTailllUllit! UlfCillator moves according to the laws of classical mechanics. 

Show that this wave packet does not spread with time. Obtain the transition to the 
quasi-classical case (n > v >> 1, where 2v -f* Hs the number of waves in the wave packet). 

Solution, Let us assume, for simplicity, that the wave packet is composed of 2v -f- 1 
eigenfunctions of equal amplitudes 


0 


1 

K2r+r 



where tj'/i+y eigenfunctions of the harmonic oscillator, and « is the mechanical 

frequency of vibration. 

The coordinate x of the center of the wave packet is given by 
^ j t) (x, t) dx = 

V 

""2T:Fr 2 = 

with 


v-1 


2v + 


fL. Vi/'2( ^ + y+i) 
LV 2 n+l ' 


/=-v 


where a = 



n{2n+\) 

rrioio 


is the classical amplitude of oscillations. 


It follows from the last two equations that x obeys the classical equation of motion 
for the harmonic oscillator 


x-\- (d^x~0. 

Evaluating .v® in a similar manner, we obtain (A,v)*=]c* — x* for the mean-square 
deviation 


(&xy = ( c« - ^ j 4- ( B« - J cos 2<oy. 


where 


*-2 


C* = -- = 


2 2v-f 1 


i_ v/(n + y+l)(« + ; + 2 ) 


n + i 


Consequently, (Aa:)* oscillates about a certain mean value and, therefore, does not spread 
with time. 
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From this It follows that the larger the number of waves^the smaller will be the width 
of the packet, and finally, for (2v + 1) > 1 , the width tends to zero. 



(Ihaplc'r I I 


General Theory of Motion of o Particle in 
0 Centrally Symmetric Field 


The problem of the motion of a particle in a central field of 
force (a field in which the potential depends on the distance alone, 
and not on the angles) is one of the standard problems of quantum 
mechanics. This problem provides a basis for the quantum theory 
of the rotator, which is of considerable importance in connection 
with the spectra of diatomic molecules, the theory of the hydrogen 
atom, the nonrelativistic theory of the deuteron, and so on. It is 
worth noting that in a central field of force the dependence of the 
wave function on the angles <> and cp is completely unrelated to the 
specific form of the potential energy. Accordingly, the spherical 
harmonics are of general validity; they are applicable to any 
centrally symmetric field. The classical analog of the quantum- 
mechanical investigation of the ai^ular parts of the wave function 
is the derivation of the law of conservation of angular momentum 
in a central field of force. This law is also independent of the 
specific form of the potential energy. 


A. SCHRODINGER’S EQUATION IN SPHERICAL 
COORDINATES 


The problem of the motion of a particle in a central field of 
force 

F^F{r)^, ( 11 . 1 ) 


is usually solved in the spherical coordinates r, ^ and cp, which are 
related to the Cartesian coordinates (see Fig. ll.l)by the equations 


ji[: = pcoscp, y==psincp, z = rcos^, p=:rsinO. (11.2) 


We shall now write the Schrddinger equation in spherical co- 
ordinates. 

First, using the general definition of the potential energy V as 
a quantity whose negative gradient is equal to the force F, we have 


dV = —{F^dr). 


( 11 . 3 ) 
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For the case of central forces (11.1), we obtain 

dV= — -yixdx + ydy + zdz) = — Fdr, (11.3a) 

and hence 

r 

= - J f (r)rfr, (11.4) 

CO 

where the lower limit of integration is chosen in accordance with 
the convention that V (r) vanishes at infinity. 

In particular, if the central forces are due to Coulomb inter- 
action 


F=~ 


r« 




where 2^*0 is the nuclear charge (e = — is the charge of an electron 
moving around the nucleus), we obtain for the potential energy 



00 


(11.5a) 


Now let us find the expression 
for the Laplacian y* in spherical 
coordinates. Using the identity 

V- V >P, (11.6) 

we shaU first find the components 
of a vector 


(11.7) 


in spherical coordinates. 

Bearing in mind that a gradient 
expresses the spatial rate of a 
change of scalar field in a certain 

direction ^1^ = •^), we 

obtain, in accordance with Fig. 11.1, 




dr ■■ 




_ d^ 
' rd% 

d^ 


( 11 . 8 ) 




r sin 1 



Fi|5. 11.1. Spherical coordinates. The volume 
element in spherical coordinates. 


Let us use the definition of divergence 


V ‘ B= lim 

5—0 


(BdS) 

s 

d^x 


d^x 




(11.9) 
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where d^x is the volume element in spherical coordinates 

d^x = r* sin » dr dh d<f (11. 10) 

(xi stands for the coordinates r, ft and <p), and dS/ stands for elementary areas perpendic- 
ular to the directions dr, rdft, and pdcp respectively: 



dS^ = r* sin ft dft dtp, 
dSfl = r sin ft dr dtp, 
dS^ = r dr d^. 

(11.11) 

With the help of Eq, (11.8), we obtain 



y.fl = ^ r» sin «rfW<p) dr + 

+ 'P) "* + ( r sinU ' 

(11.12) 

from which 
ordinates 

we readUy find the expression for the Laplace operator in spherical co- 


r* dr y dr] r* | sin ft dft \ dftj sin® ft' dtp®] ' 

(11.13) 

Setting in Eq. (11.13) 


and 

r‘ drV ar/~ 

(11.14) 


I d f . s,d\ . 1 

sinftdftr^^^pj 1 sin* ft dtp® 

(11.15) 

we have 

V’=V* + J VJ,,. 

(11.16) 

so that the Schrbdinger equation (4.8) takes the form 




(11.17) 

where 



kHr) = ^~^,^[E-V(r)] 

(11.18) 


is, according to Eq. (11.4), a function of the radius r only. 


B. SEPARATION OF VARIABLES. EIGENFUNCTIONS 

We shall solve Eq. (11.17) by the method of separation of vari- 
ables. Let us represent the desired function as a product of the 
radial and angular parts 


^ = cp). 


(11.19) 
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Multiplying the original equation by , we obtain 

+ = ( 11 . 20 ) 

Since the left-hand side depends only on r and the right-hand side 
only on the angles & and <p, this equation can be satisfied only if 
both the left- and right-hand sides are separately equal to a con- 
stant X, called the separation constant. 

We, therefore, obtain the following equations for the radial and 
angular parts, respectively: 

V?l? + (fe’-p)/? = 0, (11.21) 

+ = (11.22) 

The important point to note is that the angular part of the wave 
function does not contain the variable r and is independent of the 
specific form of the potential energy V. Consequently, as we men- 
tioned at the beginning of this chapter, the angular solution will be 
valid for any central force. 

Using the method of separation of variables for the angular 
part alone, we set 

K = 0(^)«>(cp), (11.23) 

and thus obtain the following equations for the functions 9 and<&: 

'’W+{‘-5Sr)9='>- 

VJO-f m’'l> = 0. 

Here m’ is the separation constant and we have used the following 
notation: 


(11.24) 

(11.25) 


VJ: 


1 ^ 
sin 9 rf9 

VI == 


(sind/^), 




(11.26) 

(11.27) 


where partial derivatives are replaced by total derivatives, since 
each of the functions 0 and ^ depends only on a single variable. 

Thus, we have obtained three equations— (11.21), (11.24) and 
(11.25)— for the energy eigenvalues Ei and the corresponding eigen- 
functions . The last equation contains only a single parameter 
m\ whereas the first and second contain two parameters each. 

Since the solution of one equation yields the eigenvalues for only 
one parameter, we must begin the solution of the entire problem by 
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solving (11.25); then, knowing w ^^roceed to solve (11.24) and 
finally (11.21). 

To find the normalization constant, we use the relation 


00 It l-K. 

f R^Rr'-dr f e*esin»rf» ^ 

b b b 

which shows that each of the functions can be normalized separately: 


oo 


^ R*Rr^dr=\, 
b 

(11.28) 

JC 

^ e'-esinOrfB^ 1, 

n 

(11.29) 

u 

2it: 

f I. 

(11.30) 


0 


The particular solution for the azimuthal function [see Eq. 
(11.25)] can be written in two ways: 


(11.31) 

or 

(I) = A cos (mcp cpn). (11.32) 

Each of these solutions has a different physical interpretation. 
The solution (11.31) represents a wave traveling around the cir- 
cumference of a circle and corresponds, for example, to uniform 
circular motion of an electron. On the other hand, the solution 
(11.32) is associated with standing waves and corresponds, for 
example, to oscillations of an electron along an arc. In order for 
the function fl> to describe the motion of an electron around the 
nucleus, it must have the form of traveling waves (11.31). More- 
over, since a solution proportional to e can be obtained from 
the first solution by replacing (m) by (- m), we can take, without 
any loss of generality. 


= (11.33) 

where the quantity m assumes both positive and negative values. 

Since the wave function must be unique (see Chapter 4, Section 
B), the function <I?(.p) must be periodic with a period 


(1^ ( cp) = cD ((p 2 t:). 


(11.34) 


It follows that 
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and therefore the quantity tn, which is called the magnetic quantum 
number, assumes only integral values 

m = 0, zbl, ±2, ii:3» .... (11.35) 

From the normalization condition (11.30), we find It 

is readily shown by direct calculation that the functions 

(11.36) 

satisfy the condition of orthonormality 


2k 

b 

Since we now know the eigenvalues ni and the wave function 
associated with the azimuthal angle <p , we can proceed to solve 
Eq. (11.24). Introducing the new variable 


x = cosO (11.37) 

and denoting derivatives with respect to .v by primes, Eq. (11.24) 
becomes 


1(1— x*)eT -!-(>■ — (11.38) 

It can be seen that (11.38) has singular points at a — 1: 1, that is, 
points at which one of the coefficients of B becomes infinite. To 
eliminate this divergence, we shall look for a solution 0 in the 
form 


0=(1 (11.39) 

Substituting (11.39) into (11.38) and dividing all the terms by 
(1 , we obtain 

( 1 — JC’) H" _ 2Ar (s + 1 ) h' + [x — s* — s + n = 0. (11.40) 

We eliminate the singularity in the last term by setting 


s = zLni, 


Since the fundamental equation for H depends only on ni\ the 
solutions corresponding to these two values of s both satisfy the 
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same equation, and, consequently, there must be a simple linear 
relationship between them: 

0(m) = i4 0( — m). (11.41) 

It Is, therefore, sufficient to solve Eq. (11.40) for 

s=m^0. (11.42) 

With the help of Eq. (11.41), the solution can be automatically 
extended to the negative values of m. 

Underthe condition (11.42), Eq. (11.40) becomes 

if — 2x{m+\)u! + (k — m(m+\))u = 0. (11 .43) 

Since this equation has no singularities, its solution may be 
represented as a polynomial 


Substitution of this polynomial into Eq. (11,43) gives 

k^O 

Collecting the terms with the same powers of x, we obtain 

2{(A + 2)(*+l)a*„+[X-(* + m)(A + m+l)]a*}x* = 0. 

This yields a recurrence relation 

(^ + 2)(A:+l)a,^, = ^[X-~(^ + m)(^ + /n+l)]a„ (11.45) 

which gives the relationship between the coefficients of the series 
(11.44). Since the coefficients are e3q)ressed in terms of and 
thus only alternate terms are related, the function u will be either 
even or odd depending on whether the main term is even or odd. 

We require that the series (11.44) terminate at some maximum 
power k ~ g, so that 


Then from (11.45) we obtain 


where 


X = ((7-f m)(7 + m+ 1), 

< 7 = 0 , 1 . 2 , 3 ,... 


(11.46) 

(11.47) 
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(that is, q is equal to the power at which the series is teirminated). 
Introducing the orbital angular momentum quantum number l 


l = q + m, (11.48) 

we find that, just like the numbers q and m, this number can 
assume only positive integral values (Including zero), that is, 

1 = 0, 1, 2, 3 (11.49) 

and from (11.48) it follows that 

l^m. (11.50) 

According to (11.48) and (11.46), we have 

X = l(/ + 1), (11.51) 

and, therefore, Eq. (11.40) can be reduced to the form 

(l—x’)M" — 2;t(m+ !)«' + [((/ + l) — m(m+l)]a = 0, (11.52) 

where 


u = « + a, _m_ 2 jr' 2 + . . . + (11.53) 

Instead of determining the relationship between the coefficients a, 
and by means of the recurrence relation (11.45), let us represent 
the solution (11.53) in a closed form. For this, we introduce the 
function 


t; = (A;*-l)', (11.54) 

satisfying the equation 

{\—x'*)xf -{-2xlv = 0, (11.55) 

which is easily obtained by taking the first derivative of v with 
respect to ;c. Differentiating Eq. (11.55) with the help of Leibnitz’s 
rule [see (10.34a)] and setting 




dx^ + ' 


(x’ -!)' = «„ 


(11.56) 


we obtain the following equation for the function «, : 

(1 — x^) u" — 2x (tn l)«I4-(i + »i+ 1)(^ ~m) «i=0. 


(11.57) 
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We note that this equation is exactly the same as the differential 
equation (11.52) for the function w. Consequently, functions u and 
Ui must be proportional to each other 

t/ = const • Ui. (11.58) 

Since the normalization coefficient of the function 0 has not yet 
been determined, we can set this proportionality constant equal 

to in order to make the solution (11.58) for m = 0 identical with 

the Legendre polynomial 


We thus obtain 




1 — 1 )' 


‘2‘/l 




“ 5'/! ^ 


(11.59) 


from which, with the help of Eq. (11.39), we find the following 
expression for the function f>: 

er = crP”(x). (11.60) 

Here PT is an associated Legendre polynomial defined by the 
equation 

( 11 . 61 ) 

andcr is the normalization coefficient. 

Although (11,61) was obtained for positive values of triy it 
can also be extended to include the negative values of m by using 
the well-known relation* 

P«(x) = (-l)'»iJ^P-'»(x). (11.62) 


*To prove Eq. (11 62), we put it in the following form with the help of (11.61): 

^ = (/+|m|)i (x^-1)'. (11 63) 

Since P/” and Pi must be linearly related to each other [see (11 41 ) 1 , it is sufficient for 
us to show that the coefficients of the leading power of x on both sides of Eq. (11.63) are 
equal to each other, that is. 




fl+Iml 21 


dx 


Z+lml 


■ ^21 


(/+ |ml)» . . a: 


This IS easily shown shown since 


d'^x" 

dx* 


n' 


^n-k 


for 


k ^ n , 


(n~ky 

0 


for k > n . 


(11 64) 
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From (11.61) and (11.62), it follows that the range of variation of 
the quantum number m is 


m = 0 . ± 1 . ±-2 ± 1 ; 


since for lm|> / the solution P{" vanishes. ^ 

The coefficient in (11.60)canbefoimdfrom the normalization 
condition 

n 1 

f er sin » = f er (•*) er (■^) dx=i. 

& -I 


Substituting the solution (11.60) and using (11.62), we obtain 


(-I)” (/+»!)! 

( 2 ' /!)“ (/ -- III)'. ' ‘ 



1 . 


Transferring the derivative from the second factor in the integrand 
to the first factor I -j- m times (that is, expanding the integral by 
parts /-|-m times), we obtain 


I (l + m)l 
(2'/!)'' (/— /fl)l 


1C«|« 




( I — x^)‘ (x^—l)‘dx=l. 


Using the equation (see also (11.64)] 

dx‘‘' \ 0 (rt<2/), 

and 


we find 


^ (1 — x^)^ dx= 


(/!)» 2‘'+‘ 
(2<+l)! ’ 



(2/+1) (/-m)l 
2 (i+m)! • 


(11.65) 


^Because of the linear relationship between P/” and P/ many authors present the 
solution for the function 0 in the form 

e - (X) . 

We shall not use this form, since in this case the recurrence relation between the asso- 
ciated Legendre polynomials is more complicated than for the solution (11 60) (the recur- 
rence relation is important in connection with the selection rules and the solution of the 
Dirac equation). 
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Then 


( 11 . 66 ) 

For the spherical harmonic YT (ft, f), which satisfies Eq. (1.22), 
the relations (11.23), (11.36) and (11.66) yield 

yt (ft. <p) = 

The orthonormality condition for the spherical harmonics takes 
the form^ 

^ (Y?')* YTdQ = htv Km- (11.68) 

After having obtained the eigenvalues of the parameters m and 
X, we may proceed to the solution of Eq. (11.21) for the radial part 
in which there remains only one unknown parameter. The radial 
solution, however, can be obtained only if the form of the potential 
energy V (r)ia specified, and, therefore, we shall leave this question 
aside until we come to consider various specific forms of V {r) in 
the following chapters. 


C. PHYSICAL MEANING OF THE QUANTUM NUMBERS 
/ AND m. ANGULAR MOMENTUM 


We have found that the quantiun number I characterizes the 
eigenvalue X = /(/ -f- 1 ) of the operator --v8, » (see (11.22) and (11.51)], 
which is a part of the Hamiltonian 


H = — 






2mo 


2mor* 


--\-V(r). (11.69) 


Comparing this Hamiltonian with the classical Hamiltonian function 


//= K(r) = 


Pi 




2//io 


2mor* 


■V(r), 


(11.70) 


^To prove the orthonormality condition (11 68), we substitute the expression (11.67) for 
the spherical harmonics into (11 68). Integrating over the angle tp, we can readily show that 


J_ 

277 



277 , 

)<P' 


Integrating the Legendre polynomials over the angled, we can set m' - m. Then without 
loss of generality, we can take ^ The case l' = I was considered above in connec- 
tion with the determination of the normalization coefficient. In similar fashion, it can be 
readily shown that for / <l the integral (11 68) vanishes as a result of transferring the 
derivative from the function with subscript I to the function with subscript I . 
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where p,=mef. and we see that the operator (— 

corresponds to the square of the angular momentum L’ in the 
classical case, and the operator ( — to the square of the radial 
momentum p*. 

Let us investigate these analogs in more detail. As we know 
from classical mechanics, the angular momentum L is defined as 

L= rxp. (11.71) 


If external forces F exert a torque M = rx F,the time rate of 
change of L is given by 



(11.72) 


In the case of central forces (fllr), no torque M is exerted and, 
consequently. 


L = const. 


In classical mechanics this result is known as the law of conserva- 
tion of angular momentum; it appears in Kepler’s theory of plane- 
tary motion as the law of conservation of areal velocities (that is, 
the law of areas). 

To generalize the classical expression for the angular momen- 
tum to the quantum case, we replace the classical momentum p In 

(11.71) by the momentum operator p=: jy. We then obtain 

L = rxp=jrxv 
or 

= — ZPr 

L^ = Arpy — yp^. 

We note that the angular momentum operators Ly andL^ do not 
commute with each other. For example, by direct calculation of 
the commutation relation between L;,andLy, we find 

(yp^ zp^) (zpj, -^^Pz) (zPz ■*^Pz) (yPz — Zpj,). 

Using the commutation relation between the momenta and the 
corresponding coordinates [see (7.7) and (7.8)], we find 


(11.73) 

(11.74) 


— LyL^ = — in (yp^ — xpy) = ini^. (11.75) 

Similarly, it can be shown that 

byl^z * l^zl^y — — 

LzLjp — — LjifLz — 


(11.76) 
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To express the square of the angular momentum operator 


L* = LH-4 + U (11.77) 

in spherical coordinates, we must first determine the components 
L^. Ly and L, in these coordinates. Using the relations (11.2) between 
Cartesian and spherical coordinates, we have 


dx I 

^ ^ dx~d^ ■ 


0^ 


~dy ““ 

= r COS 0 cos cp ~\~r cos ^ sin cp — /* sin d ; 

dz * 


xz , yz 

p dx~^ ^ dy ^ 


()<|/ ^ 4 ' _L ^' 1 ' 

^<p dx ^ dy d<p dz dcp 

= — AsinOsiny ^+/-sindcos.p|^=r— 1/ +a: ^ 


(11.78) 

(11.79) 


Multiplying Eq. (11.78) by y and Eq. (11.79) by(—-^), adding these 
products, and remembering that we obtain the relation 

= (11.80) 

Now let us multiply Eq, (11.78) by and Eq, (11.79) by 
Then, proceeding in the same way as before, we obtain 

^^2 ~ ^ 2 = “ {*'” 'p 2 + 2} • 

Using Eqs. (11.79) and Eqs. (11.74), we find 


-T{®‘"'p2 + “®7Cot&|}. 

(11.82) 

j{coscp^-sin ?cotft0. 

(11.83) 

11 
i d<f* 

(11.84) 


Introducing the variable a; = cos & (which should not be confused with 
the Cartesian coordinate x), Eqs. (11.82) and (11.83) can be written 
in the form 
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To determine the effect of these operators on the spherical 
harmonics, let us take advantage of the fact that a particular 
spherical harmonic can be represented either as (11.67) or as 


Kr = (— O'" ]/' — P-™ (cos »)<?""» (11.86) 

Operating directly with on the spherical harmonic, we find 

L,YT==nmYT. (11.87) 


From this it follows that the quantum number tn characterizes the 
z component of the angular momentum. 

. To determine the effect of the operator Lj^ + iLy on the spherical 
harmonic Vj” , we use the expression (11.67), and for the effect of 
the operator — we use the equivalent e3q)ression (11.86). Then 
from the equation 






X d 

df 



l±m 

= qr e“f ‘“i'* ( 1 — a:*) * ' 




it follows that 

(L^ ± iL,) Fr == - ^ /(< + 1 ± in) (I rp in) Y?-- (11.88) 

Equations (11.87) and (11.88) yield 

IjyT = [ Y (L* + ‘ Ly) (h, - i Lj,) + j (L, - ' ^y) X 

X (L, + i L,) + u] YT = - ftVi. ,YT = 

— hH{l-\-\)YT- (11.88a) 

The last equation shows that YT is an eigenfunction of the operators 
and LI This follows from the fact that the operators and 
commute not only with each other, but also with the Hamiltonian H. 
Since the operators and do not commute with it is impos- 
sible to find a wave function that would be a simultaneous eigen- 
function of the operator and the operators or L^. This does 
not mean, however, that the direction of the z axis is a preferred 
direction, since it can be chosen arbitrarily. 

The spherical harmonic can be written in such a manner that it 
will be an eigenfimction of the operators Lj, and L* In this case, 
it will no longer be an eigenfimction of the operator (see Prob- 
lem 12.2). 
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D. ANALYSIS OF THE RESULTS 

Quantuni'-mechanlcal results are generally analyzed either by 
finding their classical analogs or by comparison with the results 
of Bohr^s semi- classical theory, which has a simple physical 
interpretation. To apply Bohr^s theory to the motion of a central 
field, we start from the classical law of conservation of angular 
momentum; we then conclude that the motion takes place in a single 
plane and that the angular momentum vector (which is perpendicular 
to this plane) has the magnitude 

L = = const. (11.89) 

Applying the quantization rules, we find the discrete values that the 
angular momentum can assume 


^ p^d<f = 2itLg = 2TzHn^. 

Hence 

Ll^=nlh\ (11.90) 

where 

/2,= 1, 2, 3 (11.91) 

If the z axis is not perpendicular to the plane of the orbit, then 
the Bohr theory allows us to quantize the projection of the angular 
momentum vector on the z axis. Tnis is known as space quantiza- 
tion. Then 




(11.92) 


where 


n^ = — n^, — n^+I, .... 0, n^—\, n^. (11.93) 

It follows from this that the angle a between the direction of the 
angular momentum L and the z axis is given by the equation 

cosa = ^; (11.94) 

that Is, it can assume only certain discrete values. 

Space quantization is illustrated in Fig. 11.2, which shows that 
— corresponds to the case when L is parallel to the z axis 
(Fig. 11.2a), where n^=—n^ corresponds to the case of anti- 
parallel L (Fig. 11.2b). Finally, for n^=0, the vectors are mutually 
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perpendicular (Fig. 11.2c). It is obvious that space quantization has 
a meaning only when therein some preferred direction in space, for 
example, the direction of the magnetic field Intensity vector."' If 
there is no preferred direction, the orientation of the z axis may 
be taken as perpendicular to the plane of the orbit. 




Fig. 11.2. Space quantization according to the Bohr 
theory (for L - 1; in units of h) 


Now let us compare the quantum-mechanical results with those 
of the Bohr theory for the square and the z components of the 



It is seen that L\m. is zero when / = 0, whereas Lq can never be 
zero. This me ansJhglJbfi^tateJKitlL^= ==Q has no classica l analog. 
It follows that the angular momentum oi an atom in tke lowest 
state is equal to zero, contrary to the results of tne hour tneory, 
tIJ? i}XI]yi4mentai aata from atomic spectroscopy fully confirm 
this quantum- mechanical result. 


is, of course, understood that in the presence of a magnetic field the central 
symmetry is disturbed. 
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In the Bohr theory the direction of the z axis can be taken to 
coincide with that of the orbital angular momentum. In this case 
and, therefore, 


In the wave theory this case corresponds to m = I, when 

= ( 11 * 96 ) 


whereas 


The appearance of the additional orbital angular momentum HH is 
related to the noncommutativity of the angular momentum opera^ 
tors L;,, , and , as a result of which the angular momentum 

components cannot have simultaneous definite values. Therefore, 
when = the components and Ly do not vanish but 

have certain minimum values satisfying the relation 

^qm~ ^«max“t~ (11.98) 

The minimum value of and (AL^)^ may be obtained with the 

help of the uncertainty principle [see (8.13)]: 

*4 I ^y^x\^ ~ 4 ' max ~ T (11.99) 

Since the problem is symmetric with respect to the x and y axes, 
we may set (^^x)miiF=(^^y)min> Hence, we obtain 

(A^Jml„=(Al^>)mln=ft’y, (11.100) 

and the sum of i^^xYmin and (ALy)lnin is exactly equal to the addi- 
tional angular momentum HH. As a result we arrive at Eq. (11.97). 
Thus the nature of this additional term is the same as that of the 
zero-point energy of the harmonic oscillator. Both are related 
to the uncertainty principle. For large values of the orbital angular 
momentum quantum number/, we can neglect the term HH in (11.96) 
in comparison with HH^f so that in fact we have the Bohr semi- 
classical solution. 



Chapter 12 


The Rotator 


Spherical harmonics, which are the eigenfunctions of the square 
of the angular momentum, have their main application in the 
quantum theory of the rotator, that is, in the quantum-mechanical 
description of th e free motion of a point over a sp lmre. The 
results of the theory of the rotator can be used directly in connec- 
tion with the spectra of diatomic molecules. Since, however, the 
angular part of the wave function in a central field is also described 
by spherical harmonics, many predictions from the theory of the 
rotator (for Instance, the angular dependence of the wave function 
<(| and the selection rules for the quantum numbers I and m) remain 
unchanged in the theory of a particle in a central field (for example, 
a particle in a Coulomb field in the problem of the hydrogen atom). 


A. EIGENFUNCTIONS OF THE ROTATOR 

We shall first write the basic results of the quantization of the 
rotator according to Bohr’s theory. These will be used as a 
starting point in our further discussion. 

Suppose a point is moving over a sphere of radius r=a = const. 
Let the origin of the coordinate system be at the center of the 
sphere. The potential energy V (r) is then 

V (/•) = !' (a) = const. 

Since the reference level of the potential energy can be specified 
in any desired manner by defining its value at some point as zero, 
we set 


F(a) = 0. (12.1a) 

The total energy of the rotator is then equal to its kinetic energy 

E = ( 12 . 2 ) 

The generalized momentum which here has the meaning of the 
angular momentum, is found to be 
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Using Bohr’s quantization rule, we obtain 


and, consequently, 


y 

2J • 


(12.4) 


(12.5) 


where J = rriaa^ is the moment of inertia. 

The quantum- mechanical theory of the rotator is a special 
case of the problem of motion of a point in a central field of 
force. Consequently, we shall use Eq. (1.21) to determine the 
radial fimctlon R (/•): 


v'rR ir) + R (r) = 0. 


( 12 . 6 ) 


We have here set the potential energy equal to zero and substituted 
^ = in accordance with Eq, (11,51). Since /' = a = const for 

the rotator, function /?(r)=/?(a) = const, that is, = The 

energy Ei is now easily found from Eq. (11.51) 

p (^ + i) + 1) no 7) 

Comparing this equation with Eq. (12.5), we see that in the Bohr 
theory whereas in quantum mechanics /(/+ 1). As has 

alrea^ been mentioned in Chapter 11 (Section D), this difference is 
due to the noncommutativity of the components of L^, Ly and of 
the angular momentum operator; it is one of the characteristic 
features of quantum mechanics. Both equations become identical 
only for large quantum numbers, that is, when 

According to Eq. (12,7), the energy of the rotator depends 
only on the orbital angular momentum quantum number /. The 
magnetic quantum number m, which characterizes the projection 
of the angular momentum L on the z axis (and, consequently, the 
orientation of the angular momentum in space), does not appear in 
the expression for £■/. The eigenfunctions corresponding to the 
eigenvalue [see (11.67)] do, however, depend on m. Since m can 
vary from — / to [see (11.50)], each energy eigenvalue will 
have 2/+ 1 corresponding mutually orthogonal eigenfunctions; they 
describe the state of the rotator and differ only in the orientation 
of their angular momentum L relative to the z axis. In this case 
the energy level Ejis said to be (2/+ l)-fold degenerate. 

In general, a state of a system (or a given level) is said to be 
A/- fold degenerate if M linearly independent eigenfunctions corre- 
spond to the given energy eigenvalue. 
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The Dhvaical e xplanation for the levels 

of the rotator i s that me rotator forms a centrally symmetric 
system, a^id consequently all directions passing theough the origin 
of coordinates are equivalent. From these considerations it fol- 
lows that degeneracy will occur in any centrally symmetric 
system. 

If there exists a preferred direction, for example, one deter- 
mined by the direction of a magnetic field, the central symmetry is 
disturbed and the possible directions of the angular momentum L 
are no longer equivalent. As a result, the degree of degeneracy is 
either reduced, or the degeneracy can be completely removed. 

In spectroscopic notation the energy levels are called terms; 
for example, the level corresponding to / = 0 is called the s term 
and the level corresponding to /=! , the p term. For the d term, 
/ =2; for the f term, 1 = 3; for the g term, 1 = 4; and so on. 
Correspondingly, the rotator is said to be in the s state when / = 0» 
in the p state when / = 1 , and so on. 

Let us consider in more detail the s and p states of the rotator. 
Since l = m = 0, in the s state, it follows from Eq. (11.67) that the 
eigenfunction 7; corresponding to the energy eigenvalue == 0 is 



and the probability density | KJI* is given by 

(12.9) 

In the p state / = 1, and the quantum number m can have any of 
the three values —I, 0 and -f-'l. Consequently, the energy eigen- 

fi'i 

value £i== is associated with the three eigenfunctions: 


F,-‘ = — e-‘f sin », (12.10) 

( 12 . 11 ) 

F{ = j/'Te'»sin». (12.12) 

The corresponding probability densities are 

|Fr‘l“ = iyj|’=87sin*», (12.13) 

|FJ|’=^cosH. (12.14) 
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The probability distribution fimctions (12.9), (12.13) and (12.14) 
are plotted in Fig. 12.1; they are shown only In the zy plane because 
1 Yf I does not depend on the angle ft. To obtain a complete picture, 
it is necessary to rotate the graph about the z axis. 




Fig. 12.1. Probability density distribution functions for the 
rotator. 

It can be seen from Eq. (12.9) and Fig. 12.1a that, for a rotator 
in the s state, the angle <p , which gives the direction of the angular 
momentum L relative to the z axis, is arbitrary. This was to be 
expected since the angular momentum D = -f 1) is equal to zero 
in this case. A material particle at rest has an equal probability 
of being at any point on the spherical surface of radius a. In other 
words, all positions of the rotator are possible. There is no clas- 
sical analog of this state. 

From Eq. (12.13) and Fig. 12.1b it follows that in the p state, 
when /= 1 and m = dL 1 , the most probable of all the possible orbits 
of the rotator is the one located in the xy plane. The states with 
m=l and m= -1 will have different directions of rotation: for 
ni = 1 the rotator will rotate clockwise (the angular momentum L is 
parallel to the z axis); for m = — 1 it will rotate counterclockwise 
(the angular momentum L is antiparallel to the z axis). When 
/ = 1 and m = 0, the most probable orbit of the rotator lies in a 
plane passing through the z axis [see Eq. (12.14) and Fig. 12.1c]. 
In this case, the orientation of the angular momentum is perpen- 
dicular to the z axis. 

It is worth mentioning that a similar analysis of the angular 
part of the wave function applies to all systems characterized by 
central symmetry. 
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B. SELECTION RULES 

As has already been shown, the selection rules indicating the 
changes of the quantum numbers that correspond to allowed transi- 
tions can be expressed in terms of the matrix elements 

(rC = j‘ (Y7')* rYT dH. (12.15) 


I f the matrix element vanishes for some particular change of the 
qua ntum numbers, the corf^e s ponding transition i forbidden (there 
will^e no radiation). One e^ we know the selection rules, we can 
immediately find both the frequency and the intensity of the radia- 
tion [see Eq. (9,22)], 

Let us replace the coordinates y and z (that is, replace r) in 
Eq. (12.15) by the following new variables: 


2 = a cos 

(12.16) 

? = X iy = a sin V'*’, 

(12.17) 

]==x — iy = a sin 

(12.18) 


From the physical point of view, this is equivalent to resolving 
the motion of the rotator into three parts— an oscillation along the 
z axis described by the z component; a clockwise rotation in the 
xy plane (the c component) ; and a counterclockwise rotation in the 
xy plane (the t] component). In combination, these three components 
completely describe the motion of a point over the surface of a 
sphere. 

In terms of the new variables the determination of the selection 
rules reduces to a calculation of the matrix elements: 


(z)';;"' = 1 {Yf)* cosO yf dii. 

(12.19) 

(5)';^’' = § (^?')* sin Y'l’ dQ, 

(12.20) 

= (| (Yf)* sin Oe Ff' dQ, 

(12.21) 


where, for the sake of simplicity, we have set a= 1, 
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Using the recurrence relations for spherical harmonics* 


cos i» 

sin 


( 12 . 22 ) 

(12.23) 


together with the orthonormality condition (11.68), we find 

{z)im =* const ^m\ m^Z', /± 1 • 
iS)lm = const OT-i-l^Z', Z ihl » 

(Tl)zm const //I— lO/r^ / j- |, 

Therefore, we obtain the following selection rules: 

(a) for vibration along the z axis 


(12.24) 

(12.25) 

(12.26) 


Am = m — m' = 0, A/ = / — /' = zii 1 ; 


(12.27) 


(b) for clockwise rotation 


Am==— 1, A/ = dbl; 


(12.28) 


(c) for counterclockwise rotation 


Am = 4-1, A/=:itl. 


(12.29) 


We have just shown that the only allowed transitions are those 
for which the changes of the magnetic quantum number m and the 
orbital quantum number I are 


rAm = 0. It 1, 

I A/ = itL^ 


(12.30) 

(12.31) 


*The coefficients A and B can be found in a fairly simple way We substitute the 
expansion (11 67) into Eq. (12 22), setting 


pm _ C 

2hul-my t 


(/-m)(Z-m~l) „/-m-2 , 
2(2/-l) 


Then, dividing all the terms by e*'"**’(l — and equating the coefficients of 

and the coefficients of on the left- and right-hand sides (nothing further is obtained 

by equating the coefficients of the remaining powers of x), we find 


-ffl+jwnnroiil B(l m) = J.i lt B 

f (2/H)(2/ + 3) T(2i+1 


1 )( 2 /- 1 ) 


Similarly, we find 


AiO.m) = i JilllAJSm+IIl 

f (2/+l)(2/ + 3) 


Bi(/,m) = + 


(12.22a) 


(12 23a) 


(2/+l)(2/~l) 
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We note that these selection rules for the quantum numbers m and 
I will hold for any centrally symmetric system including, in 
particular, the hydrogen atom. 

From the selection rules, we can find the possible emission 
(or absorption) frequencies of the rotator: 

= (12.32) 

71 

Substituting the expression for the energy E, [see Eq. (12.7)] 
and considering that the moment of inertia of the rotator does not 
change in this case, we can reduce Eq. (12.32) to the form 

= 17 + 1 ) - ^' (^' + 1 )] . (12.33) 

From Eqs. (12.31) and (12.33), we obtain 

= j/. (12.34) 

<«/.hi=-7(H-1). (12.35) 

where the frequency co^ corresponds to a transition from a higher 
energy level to a lower one (a downward transition) and to an 
upward transition. 

C. SPECTRA OF DIATOMIC MOLECULES 

There are three main types of spectra— the continuous spectrum 
of radiation emitted by a heated body (for example, black-botfy 
radiation, with a spectral distribution described by Planck^ s 
formula); line spectra (or atomic 
spectra), caused by transitions of 
atomic electrons between energy 
levels (for example, the Balmer 
series for the hydrogen atom); and 
finally, band spectra character- 
izing molecular radiation. A band 
spectrum consists of bright bands 
with a sharp edge on the low-fre- 
quency side and a diffuse boimdary 
on the high-frequency side. Only 
a high- re solution spectrograph can 
show that the band actually consists of a series of individual 
lines. 

As we shall see below, band spectra are directly related to 
the rotational motion of molecules. 

Let us consider a molecule consisting of two atoms with masses 
tni and separated by a constant distance r (see Fig. 12.2). An 


^2 

G- 


mi 


-o 


Xo-O 


Fig. 12.2. Diagram of a diatomic 
molecule. 
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example of such a molecule is provided, in a first approximation, 
by the diatomic HCl molecule. It is well known that in the case of 
two or more particles, the center of mass moves as a single particle 
whose mass is equal to the sum of the masses of all the particles: 

(12.36) 

The relative motion of the particles is characterized by the reduced 
mass ///red» the reciprocal of which is equal to the sum of the 
reciprocals of the masses of all the particles: 


— =y~ 

wired Y 


(12.37) 


To prove this, let us write the Lagrangian if of a system consisting, 
for example, of two mutually interacting particles with masses mi 
and wij 


X = - K (ATi - jci), (12.38) 

where Xi and are the coordinates of the first and second particles, 
respectively, and Xi — Xi is the distance between them. Introducing 
the relative coordinate x==Xi — Xi and the coordinate of the center 
of mass 


miXi + m2Xa 

c-m.— nii'ynh ' 


(12.39) 


we obtain 


from which, using the Lagrange equation 

dt dXi dxf 

we find that the motion of the center of mass is characterized by 
the total mass 

^.sum^c.m.= const, (12.40) 

and the relative motion of a particle by the reduced mass 
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If the center of mass is at rest (Ab.mf=^). the coordinates a:, and 
are related to the relative coordinate x by the equations 

nhx 

* nh 4* nh * ^ mi nh * 

Accordingly, the moment of inertia of a diatomic molecule is 

J = miX\ + m<iX\ == (12.42) 

which is similar to the expression for the moment of inertia of a 
single material particle whose mass is equal to the reduced mass 
and whose coordinate is equal to the relative coordinate. Therefore, 
in all the results obtained for the rotator, we must substitute Eq. 
(12.42) for the moment of inertia, setting x = a. 

If the radiation is caused only by rotational transitions, it 
follows from Eq. (12.34) that its frequency is 

(I), = 

where 

From this it is seen that rotational spectra (molecular spectra 
that result from transitions between rotational levels) consist of 
sets of equally spaced lines (see 
Fig. 12.3). The rotational spectrum, 
however, lies in the far infrared 
region (radiation wavelengths of the 
order of 100-300 (i) and its investi- 
gation is rather difficult. Absorption 
lines of this type have been discov- 
ered, for example, in the spectra of 
HCl molecules. A measurement of 
the spacing between the lines enables 
us to determinethe moment of inertia 
of the molecule. 

In addition to pure rotational spec- 
tra, there are vibrational- rotational 
spectra, which result from the in- 
ternal vibrations of a molecule in 
conjunction with its rotation. These 
spectra are not situated as far in the infrared as the rotational 
spectra and are more easily studied. 

Let us consider in general form the theory of a diatomic mole- 
cule with varying interatomic distance. This diatomic molecule 
represents an oscillating rotator. Without considering in detail 
the atomic interactions in the molecule, and using only simple 


1^4 


1 

0- 





^32 


CU2t ’^ZoUfo 




’fO 


^10 ^21 ^32 ^43 

Fi«. 12 .3. Spectrum of a rotator. 


(12.43) 

a2.44) 
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qualitative arguments, let us find the general form of the potential 
energy curve V (r ) . 

First, we must set K (/'-►O) -^oo, since the atoms cannot be 
arbitrarily close to one another. Second, the interaction between 
the atoms must become negligibly small as r->oo, and hence 
V/ (r co) 0. Moreover, since a molecule is a stable system, the 
potential energy V must be negative at a certain finite interatomic 
distance r = a^ where it has a minimum. Otherwise, if the 

molecule would rapidly dissociate (see Chapter 27 for a more 
detailed account of the bonding energy of atoms in molecules). The 
general form of the dependence of the potential energy of atoms in 
a molecule on the interatomic distance is illustrated in Fig. 12.4. 


y(r) 



Fig. 12.4. Potential energy diagram for a diatomic 
molecule 

If the departures x — r — aoi the molecule from the equilibrium 
position {r = a) are relatively small (jc<<a), the potential energy 
V (r) can be expanded in a series in the neighborhood of the equi- 
librium point r==a 

V{r)^V{a-\-x)==V (a) J- xV'' (a) + f V" (a) + (1 2.45 ) 

Keeping only the first three terms of this expansion and noting 
that the function V has a minimum at the point r = a [K'(a) — Oand 
Eq. (12.45) can be reduced to the form 


(12.46) 
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Here K"(a)=mred<»* and K(a)a= — D are the elastic constant 
and the dissociation energy of the molecule, respectively.^ 

To find the energy levels of the molecule (and thus its energy 
spectrum), we take the Schrodinger equation (11.21) for the radial 
part of the wave function, since in our approximation the potential 
energy (12,46) possesses a spherical symmetry. 

Since we are interested only in the relative motion of the atoms, 
we replace the mass m by mred in Eq. (11.21). The resulting equa- 
tion is 


v;/? + - V (/•)) - J /? = 0. 

Noting that 

I 2 ^/? 1 d^rm 

^rR = -jp + 7 = 7 —dr~ ’ 


(12.47) 

(12.48) 


and introducing the function 


rR = u, (12.49) 

we obtain after substituting (12.46) into (12.47) 

S + + ^ - - yJ‘-VA “ = (12.49a) 

' red 

Since we may assume in this equation that \ ~ . 

Then setting ^ « 


£ _|_ D — (/ + 1 ) = F, (12.50) 

where ^ = yj reduce Eq. (12.49a) to the form 

«" + (£' - u = 0 (12.51) 

This equation is exactly the same as Eq. (10.14) for the harmonic 
oscillator, and, therefore. 


where k = Qi, 1, 2, 3,... 


(12.52) 


^The dissociation energy {-D) defined as the work required to split the molecule 
into atoms (neglecting the vibrational energy). This energy is usually of the order of 
several electron volts. 
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Thus, when we consider both the vibrational and the rotational 
motion, the energy of the molecule is 

B — D -}- Bhl (/ 1 ) ho) {k Y 2 ) • (12.53) 

The first term here represents the dissociation energy, while the 
second and third terms describe the rotation and vibration of the 
molecule, respectively. 

We note that the molecule has only a finite number of discrete 
energy levels, since it is dissociated when 

Qualitatively, the dissociation of a molecule at large quantum 
numbers can be explained as follows. When k 1, the amplitude of 
vibration may become so large that the atoms will in effect stop 
interacting over this distance and the molecule will cease to exist 
as a bound system. Moreover, if the orbital angular momentum 
numbers / which characterize the rotational energy are too large, 
centrifugal forces can also split the molecule. 

We shall now proceed to investigate the vibrational- rotational 
spectrum. We shall assume that the spectrum is determined 
primarily by the vibrational energy of the oscillations because its 
order of magnitude is larger than the rotational energy 10|j. 

and lOOfi). We must bear in mind that spontaneous transitions 

can occur only from higher to lower energy levels, that is, with a 
change from k to k — 1 [in accordance with the selection rules, the 
quantum number / may change to either lower (/ ->/ — 1) or higher 
1 ) values]. Then for the frequency of the radiation 

-1, lJt\ ) 


we obtain from (12.53) 


(12.54) 


Here, in accordance with (12.34) and (12.35), we have 
+ 2fi(/+ 1), and 


0) 



Thus, two branches are obtained (see Fig. 12.5): 


= ‘“vib + and «> ' = — 2B (/ + I ) 


(12.55) 
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Such vibrational- rotational spectra are observed, for instance, 
in HCl and CO molecules, and the investigation of vibrational- 
rotational spectra is of great importance for the study of molecular 
structure. From these spectra it is possible to determine various 
properties of molecules, such as their moments of inertia and the 
isotopic composition (since the moments of inertia of two molecules 
consisting of different isotopes of the same element will be some- 
what different). 

Negative branch Positive branch 



Fig. 12.5. Vibrational-rotational spectrum of a 
diatomic molecule 

To conclude this section, let us consider the spectrum of a 
molecule when one of the atoms is in an excited state. In this case 
the vibrational- rotational radiation is accompanied by the transi- 
tion of one of the atomic electrons from an energy level n to a 
lower level n\ 

The energy of such a molecule can be written in the form 

EM^En-\-E, + E,, (12.56) 

where E^ is the energy of the excited atom. For hydrogen, E^ is 
given by the Balmer formula (see Chapter 2) 

(12.57) 

The energies of the vibrational and rotational motions are, respec- 
tively. 


and 




E, = Bhl {I + 1). 

As a result of the transition, the energy Eav 
changes and becomes equal to 

Em' = Efi’ "f" -f" Ep . 


(12.58) 

(12.59) 
of the molecule 


(12.56a) 
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Since the mainpartof the radiant energy is now due to the electronic 
transition n n' in the atom, the quantum numbers k and / may 
either increase or decrease 

k' = k±U V = ld.\. (12.60) 

Regardless of the changes in k and /, the overall change in energy 
is a decrease because of the emission of radiation due to the 
electronic transition. 

This case is characterized by a further important feature, 
namely, the strong dependence of the energy bonding the atoms in 
a molecule on the number of the particular shell in which the elec- 
tron is located. Therefore, a transition causes a change in the 
bonding energy, which in turn leads to a change in the interatomic 
distance a. In transitions from an excited to the ground stated the 
distance generally increases, together with the moment of inertia 
y =mj.e^a* whereas the quantity B = ^ decreases. Owing to the 

change there is an additional slight change in the rotational 

part of the energy, which now becomes 

Ev = BhV(V-\-\). (12.59a) 

where R'<R in the present case. 

Taking into account all possible vibrational and rotational 

yu' 

transitions, we find that the frequency of radiation = - is 

where 

= (/-f-l) — Br(/'+ 1). 

With the notation m, = — ±(o, we can write Eq. (12.61) as 

«>.M = «>o + (12.61a) 

Finally, we obtain three frequency branches for the molecular band 
spectra 


(12.61) 

(12.62) 


0>''' = «)o + “/,«- If 
a)' = u)o u),_ [ 
(i)"= (Oa u),, 


(12.63) 

(12.64) 

(12.65) 


In these formulas the first or positive branch (the R branch) 
corresponds to downward transitions between the rotational levels. 
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and the second or negative branch (the P branch) to upward transi- 
tions. The thirdbranch (called the zero branch or Q branch) appears 
when there are no transitions between rotational levels; it is due 
entirely to changes in the moment of inertia caused by trsmsltions 
within the atom. 

Using Eq. (12.62) we can represent u)‘ and m" in the form 


= — + (12.63a) 

m- = <d,-{-(B ~ B'){1 + If -(B + B') (/ + 1 ), (12.64a) 

o)« = ( 0 , + (B - B') (/' + /). (12.65a) 


These branches are depicted in Fig. 12.6, where the frequency is 
plotted along the abscissa and the orbital angular momentum 
quantum number I along the ordinate. It can be seen from this 



Fig. 12.6. Molecular band spectra: — the 

positive branch; oT — the negative branch; 

— the zero branch. 

diagram that the superposition of the rotational lines (»,, on the 
electronic-vibrational line id, gives rise to a whole band of lines 
with a sharp edge on the left and a diffuse edge on the right. This is 
in complete agreement with the experimental facts. 
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Problem 12. ly Find the explicit form of the spherical harmonics for the cases vdien 
the broitai angular-momentum quantum number is / = 2 and / = 3. Verify their ortho- 
normality by direct calculation. Plot a graph of | |* for /;? = 0, ± 1, ± 2. 

Answer, 


Y}' = ±y ~ sin » cos 

Kjt ' = ± ^ j/" sin 9 (5 cos* a - 1) , 

Kf’=.-ly^-^fsin*6cos9^±='% 

sin * de-t 


is 



Proble m 12.2.1 Find the eigenfunction of the operator given that its eigenvalue 
the orbital angular momentum quantum number is / = 1. 

Solution, Let us look for a solution of the equation for / = 1 and 




in the form 


^=CtKi + C-.Krt + CoK;. 

Using (1L88), with / = 1 and 0, and applying the normalization condition 


we find 



sin SJOrfcp = 1, 




(Ki-rr‘)= 


i/'A 
y 4n 


- sin 8 cos'f. 


grobl^ 
coor3lftflfes. 

Solution, 
written as 


Investigate the general form of the motion of a free particle in spherical 
Determine the normalized functions for / = 0(s states). 

The Schrodinger equation for a free particle in spherical coordinates is 




(12.66) 


where 


tfi __ 2Wofr 


3 


The problems in this chapter apply also to Chapter 11. 
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Introducing the new function X= YT we transform Eq. (12,79) to 


X"+ j-X' + 



x=o. 


The solution of this equation is a Bessel function of half-integral order. Since the 
wave function must remain finite as r — 0, we retain only the Bessel function of the first 
kind. Then 


^ const 

Ykr ( + 


(kr). 


and for a free particle with a given energy the general solution of the wave equation in 
spherical coordinates can be written as 


00 < . 


IwmQ m — I 


(12.67) 


In particular, fori = 0 (s state), we have 


R(k,r)=^%\T\kr. ( 12 . 68 ) 

Since the spectrum is continuous, this expression must be normalized by a & function 


00 

f /?♦ r) R (k, r) r^dr = » (A — k% 
d 

Hence, we find; C = 


Find the selection rules for quadrupole radiation from a rotator, 
necessary to find the transitions for which the matrix element of the 

quadupole moment 




Qss' — ^ J 'H'm' 


does not vanish. Here 


5, s' = I, 2, 3; Xi=ez = a cos 
Xi^s ==x ±ty — a sin be— | 

Since the matrix element of a bilinear combination can be expressed as the sum of the 
products of linear matrix elements (see Problem 10,4), 

(Vs'C'= 2 

m", r 

and, moreover, it follows from Eqs. (12,24), (12,25) and (12,26) that the only nonvanishing 
linear matrix elements are of the form 

►/ 1. m 4* 1 ± 1. 'w — 1 J '±i> fn 

V. m * \ OT * ^l, m * 

we obtain the following selection rules for the quadrupole radiation from a rotator; 
i/n = 0, i 1, ±2; A/=0, ±2. 
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As we shall see later, the parity of a spherical harmonic is determined only by the 
value of I and is independent of m (see Chapter 17). Consequently, for dipole radiation, 
the only allowed transitions are from an odd state to an even state or vice versa (A/ = ±1), 
whereas for quadrupole radiation, the transitions can occur only between even states 
or between odd states. 


Problem 12.5.) Expand the plane wave 


Jkr cos 


(12.69) 


in terms of spherical waves. 

Solution, Introducing the notation kr ~y and cos 0 = r, let us look for a solution in the 
form of an expansion in Legendre polynomials 

CO 

ely^=:yff,(y)P,(x). 

/ = 0 

Using the orthonormality condition for Legendre polynomials [see (11.68)], we find 

I 

Substituting the expression for the Lengendre polynomials as given by Eq. (11.59) and 

transferring the derivative from the function [jl to the function times (that 

is, integrating by parts / times), we obtain 


I 


Using the well-known relation from the theory of E3essel functions^ 


J (1 -x'Yeh’^dx^ Y^.l\ ( 


we determine the coefficients Bi, The desired expansion of a plane wave in terms of the 
spherical waves is now written as 


+ P,(cos»). (12.70) 


Let us emphasize that the plane wave satisfies the Schrodinger equation for a free 
particle 


and, therefore, the right-hand side of Eq. (12.83) also represents a linear combination 
of particular solutions of the above equation written in spherical coordinates. ITiis 
explains the expected proportionality between the coefficients Bi and the Bessel functions 

*^1 + V* 


^See, for example, P. M. Morse and H. Feshbach, Methods of Theoretical Physics, 
part I, p. 572, McGraw-Hill Book Co. 



Chapter 13 


The Theory of the Hydrogen-like Atom 
(Kepler's Problem) 


The Bohr theory of the hydrogen-like atom (see Chapter 3) is 
semiclassical in nature and can only provide a very incomplete 
explanation of some of the basic properties of the atom. It does not 
enable us to calculate the intensity of radiation emitted by an atom 
or to construct a theory of an atom with more than one electron. 
The wave-mechanical theory of the atom is able to deal with these 
problems without any fundamental difficulties. The problem of the 
hydrogen-like atom presents in addition a certain methodological 
interest since it can be solved exactly, like the harmonic oscillator 
and rotator problems. Mathematically this problem can be regarded 
as a generalization of the classical problem of the motion of a 
planet around the sun (Kepler’s problem). 


A. ENERGY EIGENFUNCTIONS AND EIGENVALUES 


The energy of interaction between an electron and a nucleus 


V 



(13.1) 


depends only on the distance r between them. The problem of the 
hydrogen-like atom— a single electron moving about a nucleus— Is, 
therefore, a typical example of motion In a central field of force. 
If the origin of the coordinate system is at the center of the nucleus, 
we may regard the angular part Yf of the wave function as known 
[see Eq. (11.67)] and find the energy levels and the radial part 
R{r) from Eq. (11.21), which for the present case is written 
as 


V'rR + ^^{E-\ 


r 


2/nor* 



0 . 


(13.2) 
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Let us introduce the effective potential energy of the electron' 

Ke,, — f + (13.3) 

where the first term represents the Coulomb interaction and the 
second the centrifugal forces. 



Fig. 13.1. The effective potential energy (solid line) as a 
function of distance 



The dashed line shows the behavior of the wave function 


A graph of Veu is given in Fig. 13.1. This graph shows that if 
the total energy of an electron is negative (ZTc^O), it moves within 

^We shall attempt to interpret Eq. (13.3) from the standpoint of the classical theory, 
using the classical relationship 



(13 3a) 
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a region bounded on both sides by potential barriers (the case of 
elliptic orbits in the classical analog), so that the energy spec- 
trum will be discrete. 

For B > 0, there is no barrier on the right (r -* co) and the posi- 
tion of an electron can range to infinitely large R (the case of 
h 3 q)erbolic orbits in the classical analog). Since the electron’s 
position in the atom must be bounded by a certain Amax (elliptic 
orbits), it is necessary to assume that E<^0 in order to develop 
a theory of the atom. Accordingly, Eq. (13.2) becomes 


d-R , 2 dR 

rfr* ‘ r dr 



y'+-!ijR=o, 


where 


= B > 0 and — =Ayo. 


(13.4) 


(13.5) 


Introducing the new variable 


P = 2) Ar, (13.6) 

we obtain the equation 

R" -f jR'-\-{-j + -^-pt T R^o, (13.7) 

where R' = (dRId^j ) . 

From the graph of Vefu see the general nature of the 

solution. Clearly, inside the well (''min<''<C'’max ) the solution will 
be oscillatory, whereas outside the well (/---^O and r—*oo) 
there are two solutions (on each side) : one that tends to zero and 
a second that tends to infinity. To make the solution an acceptable 
wave function, we must impose on it restrictions that will elimi- 
nate the solution that tends to infinity. Just as in the problem of 
the harmonic oscillator, it turns out that this requirement can be 
satisfied only for certain discrete values of the energy of the 
electron. 

Since the potential well is not symmetric, we shall look for 
asymptotic solutions for p — 0 and for p -co separately. The 
asymptotic solution for p — oo may be found, according to Eq. (13.7), 
from 

(13.8) 


which gives 


Roo == -j- 


( 13 . 9 ) 
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To eliminate the exponentially increasing solution, we must 
set Cj== 0 . Coefficient Ci can be included in the normalization 
coefficient of the wave equation and can, therefore, be set equal to 
unity. We then have 


= (13.10) 

The asymptotic solution for p — 0 can be obtained from Eq. 
(13.7), which in this case reduces to the form 


r:+jK--^^Ro=o. 

(13.11) 

Setting /?o = p'', we find <7 (<7 + 1 ) — ^ (^ + 0 = 0, and 
= — (/ + 1 ). Consequently, 

thus qi — l, 

^. = C,p' + C,p'-'. 

a3.12) 

Setting C 2==0 (to exclude the solution increasing to infinity at 
p = 0 ) and Cl == 1 , we obtain 


(13.13) 

Equation (13.7) can be rewritten as 


d’fR 1 [ 1 1 /(/+') 

dp* +1 4 1 p p* 

(13.7a) 

and we assume its general solution to be of the form 


R==RooRiiU. 

(13.14) 

In this case 


II 

+ 

II 

(13.15) 

and for an unknown function ii we obtain the equation 


' V ^ \ V 4 ‘ p Y j 

(13.16) 


According to (13.15), 


In 0 = — V«P + + I ) In Pi 


and, therefore. 


jy' 

V 


= (lni;)' — — -f- 


(+J 

P 


that is, t;' = ^ ^ 
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Furthermore, we have 


.L+lv^(-l4.L±l\\ 


Using these formulas, we transform (13.16) to 

P«" + [2(/ + l)-pl«' + [-^-/-l]« = 0, (13.17) 

If the behavior of the solution for R is to be determined at the 
origin and at infinity, by the asymptotic formulas (13.10) and 
(13.13), we must find the conditions under which the function u 
will be a finite polynomial of degree k without negative powers: 

k 

« = 2]a.p’. (13.18) 


Substituting (13.18) into (13.17) and collecting powers of p,we 
have 


k 



+ a..i (v (V + 1 ) + 2 (V -f 1 ) (/ + 1 )] } = 0. (1 3.19) 

Since a*^.i = 0 and a* 96 0 , we obtain 

+ (13.20) 

Here the quantum number n, which is equal to the sum of the 
orbital angular momentum quantum number 

/ = 0 , 1 , 2 , 3 ,... 

and the radial quantum number 

*= 0 , 1 , 2 , 3,... (13.21) 

plus imity, is called the principal quantum number n. It may assume 
the values 


n=l,2, 3 (13.22) 

To determine the imknown coefficients a. of the series (13.18). we 
derive a recurrence relation with the aid of Eqs. (13.19) and (13.20); 


a, {k — v) = — a,+i (v-)- l)(v-|-2/ 4*2). 


(13.23) 
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Setting the coefficient of the highest power in (13.18) ak= (— l)'' and 
calculating all the remaining coefficients with the help of (13.23), 
we obtain the following expression for the function u: 




I — 1) (/l -t~ s) (^ + S — 1) \ 

1! P "T 2! ^ •••]”■ 

k 


= 2 <- 


1 ^k-J 


2! 

k\ (k + s)\ 




p ' yi (A • 


(13.24) 


where s=2/+l. The series (13.24) is an associated Laguerre 
polynomial Ql(p) of order k and may also be represented in closed 
form^ 


w = Q* (p) = ePp-" ^ (e-fp'+O. (13.25) 

Thus, for the radial function /?„,(/■) we finally obtain 

R.i (P) = W QS'+'_, (P). (13.26) 

where p = 2y^Jr. Recalling that -~ = n, and substituting the 
value of B from Eq. (13.5), we find 


P== 



(13.27) 


where ao = — y is the radius of the first Bohr orbit. Calculating 

the coefficient C^i from the normalization condition (see below), 
we obtain 

V~n (n - 1 — D'lln + or ' (13.28) 


^We shall show here that the function ii, written in the closed form (13 25), satisfies 
Eq. (13 17) The function v -= satisfies the equation pv' + (/?— k - s)v - 0, as can 

be easily verified by taking the first derivative of u with respect to p Differentiating this 
equation k ^ I times according to Leibnitz's rule, we reduce it to the form 

» (P - s + l)u^*'*’'^ f (fe + 1)1’^*^ ^ 0 

Introducing the new function we obtain the differential equation 

pw " t (s ♦ 1 - p)w* f kw - 0 , 

which is identical with Eq (13 17) for the function u [since I - I = k]. Since it 

can be readily shown that the coefficient of the leading term (p^) of the function 

dp" 

IS identical with the corresponding coefficient of (13 24), we have demonstrated that the 
relation (13.25) is correct 
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Therefore, the radial wave function is 


Rnl 



4 

n(;i — / — l)t (n + Z)! 



(13.28a) 


As we know^ the normalization condition for the radial part of the wave function is 


J dr == 1. 

Substituting the expression for from (13,26) and replacing r by p, in accord- 
ance with (13.27), we obtain 


0 

Now let us represent one of the polynomials 0?^* as a series (13.24), leaving the other 
polynomial In the closed form (13.25ju The normalization condition then becomes 


(i^J J P (- ')M P*- ft (* + 2/ + I) p*- +...) (e-Pp»+''+') rfp = 1. 

Applying the theorem for the transfer of a derivative (see Chapter 7, Section C) to this 
equation, we find 





e-P [(* + I)! pP'+^+P — ft! ft (2/ + ft + 1) p''^''+‘] df = I. 


It is obvious that the remaining terms in the series representing the function Ql vanish, 
since the order of the successive derivatives is higher than the corresponding powers 
of p. Using the well-known integral 


f ^-Pp^fifpzrr S!, (13.29) 

0 

we obtain the expression (13.28) for , In a similar fashion, we can determine the 
average value of (r~'') 1, d, 4>, which will be useful in the later development: 

oo 

(0== J — ^ dr. 

On the basis of the above equations we can rewrite this as 


(r -l = c;, f'y )' ("J)” jj r'*' (- »• If* - * (» + 21 + 1) »*-' + 
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Setting V 2, 3 and 4 in the last expression, and using once again the theorem for 

the transfer of a derivative, we obtain after a few simple operations 


'/.)(/+ !)• 

f7Tj_J. (lY 3«‘" <(/+ ’) 

In calculating we retain only the leading term p* in the polynomial Q^, On the 

contrary. In calculating r~*, we retain only the last term po. For r*’, we retain the last 
two terms, and so on. The expressions for r~* and r * are obtained under the assump- 
tion that^ Por the s states (/ = 0), interactions which are usually proportional to 

either or r~* are generally replaced by a contact interaction (see below. Chapters 19 
and 20). 

To supplement this general treatment, we shall calculate the 
normalization coefficient of the wave function for the 
ground (or lowest) energy state, which is characterized by the 
quantum numbers 


n = 1 , = 

According to Eq, (12,9), the spherical harmonic EJ”, Is a constant 
for / = m==0, as is the polynomial for ^ = 0. Consequently, 

from (13,26) we have for «;>ioo 

Zr 

<J.,oo = Ce (13.30) 


and, therefore, the normalization condition takes the form 

^ (13,31) 

Substituting the explicit form of the wave function '^loo and bearing 
in mind its independence of the angles d and 9, we obtain 

00 

CMicJ e '■“VVr = l. (13.31a) 

0 

Introducing the variable a'== -^ and using (13.29), we find 


The same value for C can also be obtained from the general equa- 
tion (13.28) by setting n = 1 and / = 0. Therefore, the wave function 
for the ground state is 
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❖ioo = 



rto 


(13.32) 


The energy spectrum of the hydrogen- like atom is found from 
Eqs. (13.20) and (13.5): 


2aon^ /i* 


(13.33) 


where R is the Rydberg constant 


o elmo 

2 / 1 * 


This expression for the energy, which we note is in complete 
agreement with the corresponding expression of the Bohr theory 
[see Eq. (2.50)], depends only on the principal quantum number 
n = (that is, only on the sum of the orbital and radial 

quantum numbers I and k) and is independent of the magnetic 
quantum number m. At the same time, the wave function nlm — RfiiY^ 
depends on all three quantum numbers tiy I and ni individually. 
Consequently, it follows from the Schrbdinger wave theory that the 
energy levels are degenerate. Since each value of / can vary from 
0 to /z — 1 and tn can vary from —I to -\-l [see (13.20)], the degree 
of degeneracy is 


I 


n— 1 


2 2|=2(2<+I): 


1=0 ni^—l 


Z=0 


It was shown in Chapter 12 that the degeneracy with respect to 
m is characteristic of all central fields of force and is related to 
the fact that there is no preferred direction passing through the 
origin. The degeneracy with respect to the orbital angular mo- 
mentum quantum number I appears, however, only in the case of 
pure Coulomb interaction. In most other centrally symmetric 
systems there is no I degeneracy; that is, the energy level for a 
given value of n is split into n sublevels corresponding to different 
/.^ If the system is placed in an external field (for example, a 
magnetic field) which removes the central symmetry, the de- 
generacy with respect to m also disappears. In this case, the ntYi 
energy level is split into distinct sublevels. 


^In particular, as we shall see later, the degeneracy with respect to / vanishes even in 
the case of the hydrogen atom if we take into account the relativistic effects, the nuclear 
volume, and the so-called vacuum corrections Similarly, in the spectrum of alkali metals, 
which have one valence electron in the outer shell, the influence of the electrons in the 
inner shell removes the degeneracy with respect to / 
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B. SEMICLASSICAL INTERPRETATION OF THE 
PRINCIPAL RESULTS OF THE QUANTUM-MECHANICAL 
THEORY OF THE HYDROGEN-LIKE ATOM 


In classical theory the quantity 




2mar'^ 


(13.34) 


[see (13.3a)] must be greater than zero. For elliptic orbits 
(£=:-— I i:|<;;0) it is seen that this is possible only when the radius 
r lies within certain limits (^min^''^''max) which can be found by 
setting the right-hand side of (13.34) equal to zero. Using the 
quantum-mechanical expression for the energy (13.33), we find 


Tmax 

min 





\ 


/ 


The equation of an ellipse in polar coordinates is 

r= P- 

1 + e COS cp * 


(13.35) 


(13.36) 


where the parameter P — ~^ is defined as the ratio of the square of 
the semiminor axis b to the semimajor axis a, and the eccentricity 

T/’ ^2 _ Ij 2 

£ = — characterizes the elongation of the ellipse (for s = 0, 

the ellipse becomes a circle). From (13.36) we readily obtain 
equations for 'min: 


'max — ^ (1 + e)» 
'■nun=«(< -0- 

Comparing Eqs. (13.37) with (13.35), we find 



(13.37) 


(13.38) 




e. 


(13.39) 


ti^n 

We see that the classical analog of the quantity turns out to 

be the semimajor axis of an ellipse whose eccentricity is given 
by Eq. (13.39), 
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Substituting into (13.39) the Bohr value of pl — \y 

and then the quantum-mechanical value of = 1), we obtain 


eB= l/l- (13.40) 


From this it is clear that the eccentricity becomes exactly zero 
only in the Bohr theory (/ = /z — 1) in wave mechanics, for l = n — 1 
the eccentricity has a minimum nonzero value given by 


quan r ti 


(13.42) 


This shows that in quantum mechanics, we can speak of the clas- 
sical analog of states with circular orbits only when the quantum 
number / has the value n — 1 Furthermore, it should be noted that 
in the classical approximation, an .s state (1 = 0) gives 1, 

which corresponds to a parabolic orbit. This case, however, cannot 
be associated with a parabolic orbit from the quantum-mechanical 
point of view. As we can see from Eq.( 13. 33), in quantum mechanics 
the energy is negative for all values of n (we recall that in the 
classical case the total energy is zero for parabolic orbits); con- 
sequently, the radius for / = 0 is limited only by its maximum 
value 

r 

'max I /: j * 

This lack of agreement between the quantum- mechanical and 
classical solutions for the 5 state simply means that the case /==0 
has no classical analog. 

In general, we are, of course, entitled to speak only of the 
probability of particular events in the context of wave mechanics. 
Therefore, all the results obtained from the Schrbdinger theory 
must be interpreted in terms of probability considerations. Let us 

show, for instance, that the Bohr radius a = [see Eq. (13.38)] 

for the case of circular orbits (t==^n — 1) corresponds in wave 
mechanics to the most probable value of the position coordinate 
of the electron, r = a. 

According to the normalization condition 

oo 

f r^Rl,dr=\, 

a 

the distribution of radial probability density D (r) is 

D{r) = r'R‘„,. 


(13.43) 
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In the case of circular orbits, when l = n— 1 and/j = 0, Eq. (13.43) 
gives 

D(r)=rmin^i^ (13.43a) 

According to Eqs. (13.25) and (13.26), 

1 

- -s- P 

= const 0 p"-*, 

and, therefore, we find the following expression for D{r) (see Fig. 
13.2): 

D (r) = const • (13.44) 

Determining the value of r at which this function has a maximum: 


we obtain 



(13.45) 


It is interesting to note that if we set Z= 1, the most probable 
radius r is the radius ao of the first Bohr orbit. 




r^a r 

Fig. 13.2. Radial probability density distribution function 
in the case of circular orbits. 

If the radial quantum number k ^ 0, the orbits can be said to be 
elliptical and the probability distribution D[r) assumes the form 

D (r) (13.46) 



The equation for the extremals of this function is 


= const (21 + 2) Q«+' — 

- PQI‘+' + 2p I Qf+'} Ql‘+' = 0. (13.47) 
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Since is a polynomial of the kth degree, Eq. (13.47) has k 

roots (not counting points /’ = 0 and r = oo) and 1 maxima. This 
case is very similar to the probability distribution for the motion 
of a free particle Inside a potential well (sinusoidal variation of 
the wave function, see Chapter 4) or for the motion of an oscillator 
(see Chapter 10, Fig. 10,2). 


C. SELECTION RULES, EMISSION SPECTRA OF 
HYDROGEN-LIKE ATOMS 

To determine the selection rules for hydrogen-like atoms 
(Kepler’s problem), it is necessary to calculate the matrix ele- 
ments 


Substituting here obtain 


(13.48) 


00 

f (yfr . (13.48a) 

As we know from Eqs. (12.24), (12,25) and (12.26), integration 
over the angles ^ and <p gives the selection rules for the orbital 
angular momentum quantum numbers (M = l — l' = ztl) and the 
magnetic quantum numbers (Am = m — = ztl). Using these 
results we can write Eq, (13.48a) as 

00 

= const ( *'»'■'» \ 8,.. ,i, r R,,, r'R„,dr. (13.48b) 

I OT+l ; J 

0 

Evaluating the integral"* 

00 00 ^ ^ 

I r’R,,K„4, ~ f X 


^Thls integral can be evaluated by introducing the new variable /?= — ( — + Then» 
I I ±i 

expressing and Q^/ in the forni of polynomials, we can perform the integration term by 
term. It is found that only the integral 

= 0 (nVn) 

vanishes because of the orthogonality condition. 


1 
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it is easily shown that it does not vanish for any value of n'\ that 
is, for all allowed transitions the principal quantum number can 
change arbitrarily, 

£,ev 



Fig. 13.3. Spectral senes of the hydrogen 
atom 

The wavelengths corresponding to the indi- 
cated transitions are expressed in Angstrom 
units 

Having obtained the selection rules for the hydrogen-like atom, 
let us investigate its emission spectrum. We shall first introduce 
certain conventional symbols designating the energy levels of an 
atom. The spectral terms (— E^ilfi) which depend, in the general 
case, on both n and I are denoted by the symbol («/), that is, 

(-• = (13.49) 

where n=l, 2, 3, ..., and I is replaced by one of the letter symbols 
s, P, d, /, gy h, . . corresponding to 1 = 0, 1, 2, 3, 4, 5, ..., as 
indicated in Chapter 12. Since the quantum number the 
only possible terms are I 5 ; 25, 2py 3s, 3d; 4s, 4/?, Ad, 4/; 5s, 5;?, 

5d, 5/, 5g; and so on. 

There cannot, for instance, be a Ip term, since in this case we 
would have n= 1 and / = 1, nor can there be a 3/ term, since this 
would give us n = l = 3. The radiation frequencies expressed in 
term symbols (nl) have the form 



THE THEORY OF THE HYDROGEN- LIKE ATOM 


217 


. (13.50) 

where it is necessary to bear in mind the selection rules for the 
orbital angular momentum quantum number /, namely, = 

Using Eq. (13.33), the term (nl) can also be represented in the 
form 


RZ^ 

2/t« • 


(13.51) 


where R is the Rydberg constant , We thus obtain the 

following equation for the radiation frequency ^nn ' : 

= (13.52) 

In the case of the hydrogen atom(Z = l), the Lyman series (see 
Fig. 13.3), which corresponds to a transition to the lowest energy 
level J (the Is level), is given by 


S) - (lip) == /? (-1 _ . (13.53) 


where « = 2, 3, 4, .... For the Balmer series (see Fig. 13.4), 
which corresponds to a transition to the level «' = 2from the levels 
//>2, there are three types of 


allowed frequencies: 



u 





g 

^ 5 


“>Balmer=(2s) — (np); 

< 

8 


S 




“Bataer=(2rt — («S); 










“Baln»r=(2/')-(«^i)- 

(13.54) ■ 
! 


/ 


/ 

y, / 



1' 


Since the energy states of the hydro- 
gen atom are degenerate with respect 
to the orbital quantum number, these 
three lines merge into one (see 
Fig. 13.3), and, consequently, 

r(iV- 4.)- 


Fig. 13.4. Balmer series. 

The wavelengths corresponding to 
the visible lines Ha, H^, and 
H s are given in S Angstroms (A). 
gives the theoretical position of the 
limit of the senes. 


A similar result is obtained for the Paschen series, whose fre- 
quencies (see Fig. 13.3) are given by 


(U 


Paschen 




(13.56 


where n = 4, 5, 6 . . . . 
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D. MOTION OF A PARTICLE IN A COULOMB FIELD 
IN THE CASE OF A CONTINUOUS SPECTRUM 

Although the wave function for a continuous spectrum can be 
e 3 Q)ressed in terms of a confluent hypergeometric function, the 
usual procedure adopted in the investigation of the hyperbolic 
solution is to consider the asymptotic behavior of the hypergeo- 
metric function for large values of r These asymptotic solutions 
can be obtained directly with the help of the quasi-classical WKB 
method. To investigate the hyperbolic orbits (E > 0), let us first 
use this approximation to obtain certain general results applicable 
to the case of central forces. According to Eqs. (11.21) and (11.51), 
the equation for the radial part of the wave function in the case of 
central forces is 

+ = (13.57) 


where 


«==/?r. (13.58) 

In the WKB method, the potential energy 1/ (r)is required to decrease 
more slowly than as r-^0. Furthermore, we are entitled to 
choose the potential energy in such a way that 

l/(/' = oo) = 0. 

Equation (13,57) has a singularity at the point r 0 associated 
with the term /(/+ l)r~^ The effective potential energy here forms 
a potential barrier of infinite height. Consequently, the asymptotic 
forms of the wave function which we obtained earlier for con- 
necting the WKB solutions across a slowly varying potential 
barrier do not provide a good approximation in the present case. 
Therefore, in order to use the WKB method in the case of central 
forces, we must obtain either a different asymptotic expression 
in place of the Bessel functions of order V 3 , or we must remove the 
singularity from the point /- — O to the point a:== — 00 by introducing 
the new variable A; = In/', 

We shall use the second of these methods and introduce a new 
wave function x (^) : 


(13.59) 

Equation (13.57) then takes the form 

£- ^ V(e^) - (/+ X=0. (13.60) 

to which the WKB method is applicable. For the argument z which 
determines the asymptotic behavior of ij) [see Eq. (5.55)], we obtain 
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z = — 2/«oV' (e*) - A' (/ + il’e-** | dx = 

= ]il\p\dx. (13.61) 

It should be noted that here |p| no longer represents the momentum. 

Transforming back to the original independent variable r, we 
obtain 


z= * 


ij/' 


2tn^E — 2mQV (r) — H 


hi) 


dr. 


(13.62) 


From this it follows, in particular, that in using the quasi-classical 
e5q)ression for the one- dimensional radial equation, we must make 
the following change in the orbital angular momentum: 




(13.63) 


Let us now apply Eq. (13.61) to the motion of a particle in a 
Coulomb field 




r 


for the case of hyperbolic orbits (Z:>0). In the quasi-classical 
treatment of the problem, it is necessary to use the connected 
asymptotic solutions (5,66)— (5.69). This means that in our case 
[see (13.62)J we must evaluate the integral 


J 2 2c-\-br — 2Ycf~ 

I ^ 2ar b ■]-2 Y ®/_ 

"^4 /a 2ar + b — 2Y^' (13.64) 

where 

f=aY-{-l>rhc, (13.65) 

and the value of is found from the condition /(r„)=0. 

First let us determine the asymptotic solution of the wave 
function for ro>r-»-0. Substituting 



2moZe'^ 2moB 

ft* . -fts 


into Eq. (13.64) and introducing the notation 


. m^Zel 2eJ 

«— a . T— — fiv • 


(13.66) 
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we obtain 



On the basis of Eq. (5,51), the asymptotic solution for this case 
has the form 

where 

in accordance with (13.61). Since, according to (13.58) and (13.59), 
the radial wave function R is related to x by the equation 

R^-^-X, ( 13 . 67 ) 

we find 


In order for the solution to remain finite at the origin, we must 
set D =0; that is, we choose the solution in the form 

/? = Cr', (13.68) 

which is in complete agreement with the asymptotic solution 
obtained by another method [see (13,13)], 

For the other limiting case r->^oo, we have the following 
asymptotic solution [see (5.67) and (5.66)]: 

= + (13.69) 

where, according to (13.61), p is equal to 

P = rhk. (13.70) 

Equation (13.69) is an analytic extension of the asymptotic solution 
for r-^0. To determine the quantity z from (13.62), we use the 
integral (13.64) with the following substitutions: 
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2moE 2m(iZel 

a— , 

— ('+ir- 

Then, since ]// ^ + T* we obtain at r oo 




We shall also make use of the relations 


2ftr 


(13.71) 


•. (13.72) 


_ln^/+l)) = 

= ln')/T* + (/ + -j)^ +i|y— arc tan-^j, (13.73) 

ln(T-» (^ + Y)) = 

= In i/'t* + (/ + !)’ +/|^+ arc tan (13.73a) 

Here, in writing the imaginary part, we took account of the fact 

that if>0 and + and therefore the angle determining the 

imaginary part of the logarithm in Eq. (13.73) lies in the first 
quadrant, and the angle of the logarithm in Eq. (13.73a) in the 
fourth quadrant. 

The expression for the argument z may now be transformed to 


z = /jf -j- Y • n 2/?/' — Y / — ^ + B?. 
where the phase B/ is^ 


(13.74) 


B? = — (/4--^jarc tan-^-y--7ln]/^(/ + yj*+T® +T- (13.74a) 


^From a more accurate calculation (a calculation of the asymptotic expression for the 
argument z from an expansion of the confluent hypergeometnc function), the following 
value IS obtained for the phase (see P M Morse and H. Feshbach, Methods of Theo- 
retical Physics, Part II, New York: McGraw-Hill, 1953): 

^ = -argr(/ + 1 + ir) 

If |/ -f iy\ ^1, then using Stirling’s formula 

r(l t I y ly} ~ |r(l + I f iy)\e == ^ ) * 

we obtain for the approximation (13 74a) S® 
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In particular, when 1 = 0 and 1, we may write as an approxima- 
tion 

5S==Y(l-"lnY)-|. 

Hence, taking into account Eqs. (13.69), (13.67) and (13.70), we 
find 

C sin (kr — 

R,(k)= (13.75) 

Here C is the normalization coefficient, and the total phase is 

= In 2/e/'. (13,76) 

Settii]^ Y=^^^ (13.76), we obtain the asymptotic form of the radial 

function for the case of free motion: 


R\ 


Csm (kr — ^ A 


(13.77) 


Thus the potential energy in a central field of force is taken into 
account by means of the phase shift 

r 

8, = - 5 |A*’-(/- 14 )V’ dr + 

'o 

r 

4- \ Y V'(^)-4f (13.78) 


where is the root of the first integrand. 

In the special case of a Coulomb field, this phase is given by 
Eq. (13.76). The presence of the logarithmic term depending on r 
in the phase shift is the result of the long-range character of 
electrostatic forces, which can influence the particle even at a 
very large r. 

The phase shifts are functions not only of I, but also of 
(that is, of the energy £' = -^ ). They are an essential character- 
istic of the eigenfunctions of the continuous spectrum. The value 
of the phase shift cannot be found in a general form. The quantity 
has to be determined separately for each specific problem, 
usually by approximation methods. The phase shifts are of particu- 
lar importance in scattering problems, where they are used to 
express the effective cross sections (see Chapter 29). 

The wave functions of a continuous spectrum are normalized in 
terms of a 5 function. The expression (13.75) for the wave function 
is valid practically in all space; there is only a small region near 
the center where this expression takes a somewhat different form. 
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In evaluating the coefficient C we may, therefore, assume that Eq. 
(13.75) gives a correct expression for the wave function; that is, 
valid in all space. In this case, the wave function is normalized by 
the relation 


/= f Ri{k)Rt{k')r^dr = ^{k-k'). 

We substitute the expression (13.75) for Riik) and neglect the 
logarithmic term in the expression for the phase shift 8^ because 
it increases slowly compared with r. Consequently, we find 

oo oo 

I = cosr(k-k')dr-^ ^ cos [r (* + *')- + 23,] rfr 

Q 6 

(ft -ft'), 

and the value of the normalization coefficient in the asymptotic 
solution for the central field of force is 

C=jA|. (13.79) 

An energy level diagram for the hydrogen atom, showing both 
discrete levels and the continuous spectrum, is given in Fig. 13.3. 
This diagram clearly shows the degeneracy with respect to /, 
which results in a merging of all levels with the same n into a 
single level. 

Besides the ordinary transitions between discrete levels, two 
other processes can take place— ionization and capture. Basically, 
each of these processes is the reverse of the other. In ionization, an 
electron jumps from a discrete level (E<0), such as, for example, 
the ground state, to the region of positive energies (^>0) which 
forms a continuous spectrum (hyperbolic orbits). This process 
involves the absorption of energy. Conversely, in the process of 
electron capture, a free electron jumps into one of the possible 
discrete levels, at the same time liberating a corresponding amount 
of energy. 

A certain amount of energy is required to transfer an electron 
from the ground state («==I) to the region E'^0. This energy is 
given by (see Fig. 13.3) 

where T = —^ is the kinetic energy of an electron which is no 

longer bound to the nucleus. The energy represents the ioniza- 
tion energy of the atom. It is at its minimum when T = 0; this cor- 
responds to the minimum energy (E = 0) transition of an electron 
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from the level n = \ to the continuous spectrum. As a result of 
this transition, the electron can leave the atom. For the hydrogen 
atom 


13.59 ev. 


E. CALCULATION OF THE EFFECTS OF THE 
MOTION OF THE NUCLEUS 

In developing the theory of hydrogen-like atom; we have until 
now ignored the motion of the nucleus. Accordingly, our theory is 
rigorous only for the case of infinitely large nuclear mass. 
In general, this is a relatively rough approximation, particularly 
in the case of light elements such as hydrogen and helium. By 
taking into account the motion of the nucleus, it is possible to 
explain a number of Important experimental facts. 

We shall allow for the effects of the nuclear mass M in the 
same way as we did in our discussion of the spectra of diatomic 
molecules (Chapter 12), replacing everywhere the electron mass 
mo by the reduced mass 


'"red — (13.80) 

M 

The Rydberg constant then becomes 

3 • ( 13 - 81 ) 

As a consequence of this change, the term values are slightly 
shifted: 

(n/) = ?^(l_^). (13.82) 


The radiation frequency Is therefore given by the relationship 


'»nn’ = ^^Rco 




(13.83) 


which differs from the previous one [see Eq. (13.52)] by the factor 



Ml' 


Since the frequency of the radiation depends on the nuclear mass 
M, atomic weights canalso be determinedby spectroscopic methods, 
as well as by conventional chemical methods. One successful 
outcome of the application of the spectroscopic method was the 
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proof of the existence of heavy hydrogen and ionized hellium. Pre- 
viously, the average atomic weight of hydrogen (relative to oxygen) 
was found by chemical means. The mass spectrograph made it 
possible to measure the atomic weight of each atom. These 
measurements gave somewhat different values for the atomic mass; 

0.0 145 V,. (13,84) 

^chem 


Hydrogen (h') 


This led Birge and Menzel to predict the existence of a hydrogen 
isotope, called deuterium or heavy hydrogen, with anatomic weight 
twice that of hydrogen (D=,H^). The presence of deuterium in 
natural hydrogen explains the greater atomic weight obtained in 
chemical measurements. The mass spectrograph measures the 
atomic weight of tH' along, wince the spectral lines of iH* atoms 
fall at a different place on the scale. 

Deuterium, like hydrogen, can 
enter into reactions forming, for 
example, heavy water DA discov- 
ered by Urey and Osborn in 1932. 

Deuterium is usually obtained via 
the electrolytic decomposition of 
water. The rate of evolution of or- 
dinary hydrogen, at the cathode, 
greatly exceeds the rate of evolu- 
tion of deuterium; thus, the concen- 
tration of deuterium in the residual 
electrolyte increases. (It is almost 
impossible to detect deuterium in 

natural water because of its low concentration.) The presence of 
deuterium was confirmed by spectroscopic studies which showed 
that not only does the Balmer series (n' = 2) consist of the lines 



Fig, 13.5. Diagram of the relative 
position of the spectral lines of 
hydrogen and its isotopes. 


-T8To){?-^?)- (13.85) 

but that each of these lines is associated with a second line situ- 
ated somewhat to the right. This second series of lines (see Fig. 
13.5) is described by the equation® 

wd ~ (13.86) 


® According to the latest experimental data, 

floo = 2nc • 109737. «h = 27rc • 109678, Ru = 2Trc • 109707, 
where the numbers represent the values of R^p (see footnote, page 20). 
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which can be readily obtained from (13,83) by setting the mass M 
equal to twice the mass of the hydrogen nucleus and substituting 
Z = 1 It is worth noting that the large relative difference between 
the masses of the deuterium and hydrogen atoms causes far 
greater differences in their physical and chemical properties than 
is usual with isotopes of other elements. Thus, although heavy 
water outwardly resembles ordinary water, its melting and boiling 
points are 3.81 °C and 101.4 respectively, its viscosity is 
greater and it is a poorer solvent for salts. With the development 
of nuclear physics, heavy water has become particularly import- 
ant because it is a good moderator for fast neutrons, and can also 
be used as a source of deuterium. 

At present, we know of a third hydrogen isotope, namely tritrium 
= whose nucleus consists of two neutrons and one proton. It 
forms a compound with oxygen similar to water. The ratio of 
tritium atoms to iH’ atoms in natural water is approximately 10~*% 
whereas the ratio of iD‘^ atoms to ill' atoms is 1/6800. In a mixture 
with deuterium, tritium is a very important substance for the pro- 
duction of thermonuclear reactions: the reaction between and 
iT' nuclei leads to the formation of 2 ^ 0 * and one neutron. Each 
such reaction releases more than 17 Mev of energy. Tritium is 
also a beta emitter (with a half-life of 12 years) and consequently 
is widely used as a radioactive indicator in chemical and biological 
investigations. The positions of spectral lines of tritium are slightly 
displaced relative to the hydrogen and deuterium lines (see Fig. 
13.5) and are given by the equation 

Another very important consequence of accounting for the 
motion of the nucleus was the discovery of ionized helium, first 
detected in spectroscopic studies of the sun. The solar spectrum 
was found to contain a series of lines, with positions described by 
the equation 



(13.88) 


j 

Energy is released In the fusion of deuterium and tritium into helium, just as in the 
fission of U or Pu under neutron excitation. Fusion, however, is possible only 
if the potential barrier of the Coulomb repulsion between the D and T nuclei is overcome. 
High temperatures 10^ degrees) are therefore required if this reaction is to be self-sus- 
taining, whereas in the case of fission even low-energy neutrons can easily penetrate the 
nucleus, and that even at low temperatures. To obtain a thermonuclear reaction, a mixture 
of deuterium and tritium must first be heated to a temperature of tens of millions of degrees. 
Such temperatures may be created in an atomic explosion. 
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where rii takes the values 


n, = -|. 3. 4. I (13.89) 

This series is, in effect, the Balmer hydrogen series (/ 2 i = 3, 4, 5,..,) 
with a number of intermediate lines, which form the so-called 
Pickering series, characterized by the half-integral quantum 
numbers ni= 5/2, 7/2, 9/2, .... At first, the Pickering series 
was explained by assuming that hydrogen was in a special state 
in the Sun, so that the quantum number n could assume half- 
integral values. The spectral lines, however, were later found to 
be located further to the right than is indicated by Eq. (13.89), 
and consequently this assumption had to be abandoned. The second 
hypothesis assumed that the observed spectrum arises from singly 
ionized helium , whose nuclear mass is M — 7360/72o 

whose charge is Z — 2. According to (13.83), its radiation fre- 
quencies then are 

<0„o = 2^/?Uo(^ — 

Setting /i' = 4, we reduce (13.90) to the form 

(ir)’ 


WHc = 


(i 


(13.90) 


(13.91) 


where /i==5, 6, 7, 8, • . . . 

To answer the question of whether the Pickering series was 
due to the radiation of hydrogen atoms (under the assumption that 
the quantum numbers may assume half-integral values) or to the 
radiation of ionized helium atoms (with the usual integral values of 
the quantum numbers), it was necessary to find an experimental 
value of the Rydberg constant. In the case of hydrogen, 

(13.92) 

whereas for helium 

/?He = /?o»(l -4o)- 

Careful spectroscopic studies confirmed that the Rydberg constant 
has the value (13.93), and thus it was shown that the Pickering 
series represents the spectrum of ionized helium. 
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VProbZem 13.1^ Starting with Eqs. (13.26) and (13.28)^ show that the radial wave fimc- 
tioryS 18P'M8"{!rincipal quantunr) numbers « = I, 2 and 3 are as follows: 




2 /2 


R,t- 


2 /6 


Afe-f/*(2-p), 


Rto - 


9 Ya 




■ 


9^6 


We-P/2p(4_p), 


/e„= y^Ne-f'Y, 
9 /30 ’ 


where 


A/=(iy'*a„d p = ?^^ 

\aol noo 

On the basis of these specific examples, show that the funcuons are orthonormal, 
that is. 





Hint, In proving orthonormality, make use of the fact that at different n and the 
corresponding values for p also differ from one another. 



13. y Show that the average electrostatic potential produced by a hydrogen-* 
ground state is 


.^0 (Z^l) 




'2Jr 
’ oo 




tihel)' 


Hint. Find the average electrostatic potential 
— ^0 ^ (r') I (r') and add it to die 

over the solid angle 2', use the identity 


produced by the electron ^ (r) = 
potential of the nucleus. Integrating 




: 47: 


— , r' <■ r, 
r ' ' 


p, r':>r. 



111 * Z = 1, the maximum of the probability density distn- 
butiori D = r^R* in the states Is, 2p and 3d occurs at distances of 4oo and q£/„ from 
the nucleus (circular orbits), VVhy do the 25, 35 and 3/; states have several maxima 
(see Fig. 13,6)? 
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mr)\ 




D(r) 


3p D(n 


3d 



r=^ao 




r--9a„ 


r 


Fig, 13,6, Graph of the radial probability density distribution D = 
for various states. 


f^roblem 

lonsTCr^T" 


lem 13,4^ 
1^ a 


Using the functions given in Problem 13,1, verify the following equa- 
and 3: 




/ (/ -f- 0 


)]. 


and 


Find the spread (Ar)^ =zr^ of the radial deviations for these states. 

On the basis of Fig. 13,6 and the uncertainty principle, explain why (Ar)“does not 
/anlsh for circular orbits in the quantum theory, 

^ means of the quasi-classlcal WKB method, find the discrete energy 

spectrum of the hydrogen-like atom. 

Solution, According to (5.75) and (13.62), the eigenvalues of the discrete spectrum 
[E < 0) may be found from the equation 



rt 


where ri and r» are roots of the integrand and 

2/7Io/: ^__mQZel 

flit • 


Since the value of this integral is^ 



ve obtain exactly the same expression for the energy of the hydrogen-like atom as via 
he Schrodinger theory. 


®This integral can be easily evaluated from (13 64) by setting a~-~A, b - 2B and 
^ — (/ + 1/2)^, and regarding rj and r 2 rj as roots of the equation f (r) — 0 The logarithm 
tf the complex quantity [assuming that a tends to zero from the direction of positive num- 
>ers (a. = \ 0)1 may be taken as: 

^ , iiO/3^0, 


In (/3 - la) 



1277 0 , 

ITT P < 0 
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_ Problem 13,6J Determine that magneac field intensity at the center of the hydrogen 
^111 viMUi ly TiSe to the orbital mouon of an electron. Find its numerical value for the 

2p state. 

Answer. //^ = /y^ = 0, 




niaC 



(i+l) 


For the 2p state (m = 1) 


Hi ^ 10^ gauss. 


Hint. lake the classical expression H = x p for the intensity of the 

magnetiv field produced by a moving charge. Transform to a quantum treatment of the 
problem by replacing r x p by the angular momentum operator L, Then, using Eq, 
(13,29a), calculate the average value of//. 



Chapter 14 


Time-Independent Perturbation Theory 


(AS BASIC PRINCIPLES OF THE PERTURBATION 
^ TREATMENT OF PROBLEMS 

A relatively large number of problems in quantum mechanics 
cannot be solved exactly with present-day mathematical methods. 
Thus, various approximate calculations have to be used. One of 
the most widely used approximations is the perturbation theory 
method, which was first developed to handle problems in celestial 
mechanics. It is well known that in Newtonian mechanics it is pos- 
sible to solve exactly only the two-body problem (for example, the 
Earth-Sun or the Moon-Earth problems). We cannot, however, ne- 
glect interplanetary forces and consider only the attraction of the 
planets by the Sun since many delicate phenomena are associated 
with these additional interactions (it is worth recalling in this con- 
nection that Leverrier predicted the existence of Neptune on the 
basis of the orbital deviations of Jupiter, after which the planet was 
discovered by astronomers). It thus became necessary to consider 
the many-body problem, which has no exact solution in classical 
mechanics. In celestial mechanics it was found that the perturba- 
tion problems could be handled by means of an approximation based 
on the fact that the forces between the planets are much smaller 
than the force of attraction to the Sun, In this method, one starts 
by solving the two-body problem (the zero-order approximation), 
then takes the ‘‘perturbation’* into account and finds the correction 
to the solution (the first-order approximation). In other words, the 
“perturbation method” involves taking the principal forces acting 
on a body, finding the rigorous solution for these forces, and then 
taking the “perturbing” forces into account. 

Similarly, in the quantum-mechanical treatment of the motion 
of several electrons in an atom, it is necessary to consider first 
the principal forces, such as, for example, the force between the 
nucleus and an electron. In this case, the perturbing forces may 
be taken to be the Coulomb forces of mutual repulsion between the 
electrons. In the problem of an atom subjected to an external 
electric or magnetic field whose strength is small relative to the 
electric field of the nucleus, the perturbation may be taken to be 
the energy of the electron in the external field. 
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(B.l FUNDAMENTAL EQUATIONS OF PERTURBATION 
^ THEORY 

We shall now develop the perturbation theory in a form suitable 
for stationary problems, that is, problems in which the Hamiltonian 
of the system does not depend on time. Suppose the Hamiltonian 
has the form 

H = T + l/ = T + l/» + I/', (14.1) 

where the perturbation energy V' < F®, and the main part of the 
potential energy F® is chosen in such a manner that the Schrbdinger 
equation of the system 

(E — H)^> = 0 (14.2) 

has an exact solution, characterized by E®and t])®, when the pertur- 
bation F' is neglected. Setting T + F® = H® (the zero-order approx- 
imation) and using Eq. (14.1), we can write (14.2) as 


(£_H®-~F')4^ = 0. (14.2a) 

The basic problem in perturbation theory is to calculate from this 
equation the energy values £„ and the corresponding eigenfunctions 
The solutions are sought in the form of the series 

= + f + (14.3) 

E==E® + £' + £" + ..., (14.4) 

where and F are terms of the first order of smallness relative 
to and E" are terms of the second order of smallness; 

and so on. 

As a rule, the perturbation energy F' can be represented in the 
form of a potential energy of the same order as F®, multiplied by 
some small parameter 1(1<<I). The solutions (14.3) and (14.4) 
should then appear as expansions in terms of X. Thus, E® and d/ 
will be independent of this parameter, E' and Y propor- 

tional to X, E" and Y' to XS and so on. In the expressions for ^ and 
E, let us restrict ourselves to terms of the first order of small- 
ness (that is, we shall retain only terms which are independent of 
X or directly proportional to X). Substituting (14.3) and (14.4) into 
(14.2a), we obtain the following equation for and E': 

(P E' — H® - F') (f + f ) = 0. (14.2b) 

Collecting terms of the same order, we have 


(E® - H®) f + [(F ~ F') + (E® - H®) t'l + (F~. F') f =0. (14.2c) 
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Equation (14.2c) may be regarded as an exact equation, since we 
have not neglected any terms in it, and since <1>' and £' may be taken 
to represent the sums of all terms of different orders of smallness 
(that is , f f + f ' + F ->£' + E" + .. .). To obtain the first- 

order approximation of perturbation theory, we neglect the second- 
order term (F — V') ^ in (14.2c) and use the equation for the zero- 
order approximation 

(F — H»)f = 0. (14.5) 

This equation yields the zero-order eigenvalues 

po po pn po 

•••» ^nt 

and eigenfunctions 

fu ru ...» ...» 

which are connected by the relationship 

(£J.~H«)tl>;V = 0. (14.6) 

Keeping all this in mind, we shall now investigate the equation for 
the first-order approximation of perturbation theory 

(F — H^’) - (F — V') f . (14.7) 

We assume that at the beginning, the system is in the quantum 

state n'=:^n Since and = in the zero-order approxima- 

tion, and E'=^E'n and^ — '’^; in the first-order approximation, Eq. 
(14.7) becomes 

(El - H'’) = -(£,- r) 'K. (14.7a) 

We recall that an arbitrary function can be expanded in a series 
of orthonormal functions forming a complete set and satisfying the 
same boundary conditions as the original function; therefore, we 
may assume the solution for to be of the form 

= (14.8) 

n* 

Our problem, therefore, reduces to determination of the imknown 
coefficients C„. of a generalized Fourier series. Substituting (14.8) 
into (14.7a), we have 

(£» - H«) = -(£;- V) rn, 


(14.9) 
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or, taking into account (14.6), we find 

2 Cn- rn' = -{En- V') fn- (14.9a) 

n' 


NONDEGENERATE CASE 

Let us suppose that the system is nondegenerate. Therefore, 
each energy eigenvalue £„ corresponds to one and only one eigen- 
function Then, multiplying Eq. (14.9a) on the left by and 
integrating over all space, we have 

2 Cn’ (Ei - F“n-) Kn- = - £; + J V’fn d'X. (14. 10) 



In obtaining this equation, we have made use of the orthonormality 
of the eigenfunctions 


Since the left-hand side of Eq. (14.10) is equal to zero {El — El' = Q 
for n' — n, and — 0 for n’ n), the energy correction El is 

El = V'n„, (14.11) 

where 

Vln=^'^fn*V'f„(Px. (14.11a) 

Thus, the energy correction El of the system quite naturally turns 
out to be equal to the average value of the perturbation energy V. 

It is worth noting that the expression (14.11) for the energy 
correction El was obtained by setting the left-hand side of Eq. 
[14.7a) equal to zero after it had been multiplied by tn* and in- 
tegrated over all space. Since is a solution of the homogeneous 
aquation (14.6), it follows that the right-hand side of the inhomo- 
geneous equation 

Mt = / (14.12) 

is orthogonal to the solution of the corresponding homogeneous 
equation Mij»“=0, that is, 


J (l<''7d’je = 0. 


(14,13) 
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Let US now proceed to determine the wave functions (that is, 
the coefficients €„ ) of the Schrodinger equation (14.7a) in first- 
order perturbation theory. We write Eq. (14.9a) as 

2 C„.. - £J..) =-(£;- V") 


Multiplying on the left by (n' j!: n) , using the orthonormality 
condition, and integrating over all space, we obtain 


c„.= 


K'n 


■Ei. 


where 




Thus , for we have 

n' 


(14.14) 


(14.15) 


(14.16) 


where the prime on the summation symbol indicates that the sum 
is taken over all a' except = Finally, the as yet undetermined 
coefficient of the zero-order wave function can be found from 
the normalization condition 



(14.17) 


for the total wave function 

= W + -i-i = + 2' (14. 18) 

n' 


where 


CS=I+C„. (14.19) 

' Substituting (14.18) into (14.17) and keeping only terms up to 
the first order, we have 


ic:i» J r*rnd^x+ 

+ 2'{crc„. J cPx + J = 1 . (14.20) 

n' 
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Making use of the orthonormality condition and ignoring the phase 
factor, which is of no interest to us, we obtain 

CJ=1. (14.21) 


and therefore C„ = 0* 

Consequently, for the wave function in the first-order 
perturbation theory, we finally obtain 

■j'n ===■}>;; +2' (14.22) 

n' 

From Eqs. (14.22) and (14.11), it can be seen that both '1^,1 and 
E’n are proportional to the first power of the perturbation energy 
(that is, to the parameter a). If we were to compute the corrections 
to the energy and the wave function in second-order perturbation 
theory, both En and would turn out to be proportional to the 
second power of F'(that is, to X®). 


d. degenerate case 

We shall now develop the perturbation theory in a form applica- 
ble to degenerate systems, in which, in the absence of any per- 
turbation, a given energy eigenvalue El has associated with it j 
eigenfunctions ^ y \v ^ 

Ym* 

It is obvious that any linear combination of these functions 

J 

(14.23) 

/x-1 


is itself a solution of the wave equation in the zero-order approxi- 
mation 


All vviii«^ii me eigen veil u.e ei me eiiex id 

If there is a perturbation V\ this arbitrariness is removed and 
the coefficients Ol may become connected with one another by 
certain definite relationships. Let us show this, proving first that, 
just as in the case of nondegenerate states, any particular solution 
of the homogeneous equation (14.6) is orthogonal to the right- 
hand side of the inhomogeneous equation (14.7a) of first-order 
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perturbation theory* For this purpose, we multiply (14,7a) on the 
left by and integrate over all space. We then obtain 

J (Ek - H“) d'x = - J (E'n - V) (Px. (14.24) 

Using the theorem for the transfer of a derivative [see Eq. (7.17)], 

we have 

J f (Ek - H") d'x=- J .[-r, (E'n - V’) Px. (14.25) 

Since is a solution of the Schrodinger equation (En — H") = 0, 

we arrive at the equation 

J (Ek - V) 2 (Px = 0 (14.26) 

l'--l 

Without any restriction on the generality, we may assume that all 
the eigenfunctions are orthonormal. ^ Then, since 

we obtain instead of (14.26) 


a (E„ - k ;,) = 2 

i'=i 

where 

V'n^^rnlV^Px, (14.28) 

Vn’==\^P,V'^ki.cPx, (14.29) 

and the prime on the summation sign indicates that the sum extends 
over all z'^s, except /' = z. Since the subscript / in (14.27) can take 
any value from 1 to /, we have a system of j homogeneous equations 
from which we can determine the energy E’n and the coefficients C? : 

C\(En-V[,)- CWu - ... - C;Vi = 0. 

- + C'l (£„ - K,,) - ... - C)r,; = 0. 


- CIV, I - Cii/;, _ . . . 4- c; (e; - k;,) = 0. 


(14.30) 


hf the functions 0® , are not orthonormal, it is always possible to construct from them 
by means of linear transformations, new functions which possess this property. 
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Recalling that the wave function must satisfy the normalization 
condition 



(14.17a) 


we see that the correction E'n to the energy of the unperturbed 
state En of the system is uniquely determined, as are the coeffic- 
ients Ci (and hence alsotj^J). Since, in particular, the system of 
equations (14.30) can have a nontrival solution only if its determi- 
nant is equal to zero, we have the following equation for E’ni 


(En — ^ H* 


— ^21* — V^22)» 


-Vl-V'i, 



(14.31) 


This equation is called the secular equation, a term taken from 
celestial mechanics. - ' 

If the secular equation has several roots (not necessarily / 
different roots), each one of them will correspond to a completely 
determined set of coefficients CJ, which can be found from (14.17a) 
and (14.30) by substituting the given root E'm^ for E'n . Consequently, 
the different corrections En to the energy lead to different zero- 
order wave functions. Thus, if a system is /-fold degenerate in the 
absence of any perturbation, a possible effect of a perturbation is 
to reduce or completely remove the degeneracy [this will occur in 
the case when Eq. (14.31) has / different roots]. 


E. THE STARK EFFECT 

If an atom is placed in an electric field, its spectral lines are 
generally split into components. This phenomenon was discovered 
by Stark in 1913. Experiments have shown that the effect of an 
electric field on a hydrogen atom is different than that on other 
atoms. In a weak field, the splitting of the energy levels of hydro- 
gen (for example, the Balmer series) is proportional to the first 
power of the field intensity (the linear Stark effect) , whereas for 
all other atoms the splitting is proportional to the second power of 
the field intensity (the quadratic Stark effect). In a stronger field 
(of the order of 10^ volts/cm), there is an additional splitting ef- 
fect, proportional to higher powers of the field intensity (second 
power in the case of hydrogen). In very strong electric fields, the 
spectral lines disappear completely. 

The Stark effect could not be explained before the development 
of quantum mechanics. In terms of classical concepts, the motion of 
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an electron in an atom can always be resolved into three mutually 
orthogonal oscillations. Let us consider the oscillation along the 
z axis , which can be taken to coincide with the direction of the 
constant electric field E{E^ = E^ = 0, E^ = g) The equation describ- 
ing the motion of the electron along the 2 axis^ is (for e = ~e^) 

tYi^z -|— (14.32) 

where nh is the electronic mass and <oo is the angular frequency of 
oscillations. It is readily seen that the solution of Eq. (14.32) is 

2 = - -Sf + ^ 

It is clear that in classical theory the only effect of the constant 
force ( — ^ 0 ©) Is to change the position of the point of equilibrium 
of the system. The frequency of oscillations remains completely 
unaffected. Consequently, classical concepts imply that the fre- 
quency of the radiation emitted by an atom remains the same 
whether or not the atom is placed in an electric field, since the 
frequency of the radiation is the same as the mechanical frequency 
of oscillations of the atomic electrons. Accordingly, an electric 
field cannot produce any shift of the spectral lines in classical 
theory. 

We shall now consider the Stark effect in terms of quantum 
concepts. As we have just mentioned, there are two basic forms 
of the Stark effect: namely, linear and nonlinear. The linear effect 
is characteristic only of hydrogen-like atoms because they are 
degenerate not only with respect to the magnetic quantum number 
m, but also with respect to the orbital angular momentum quantum 
number I (see Chapter 13), with which the linear Stark effect is 
associated. In all other atoms, there is no degeneracy with respect 
to /, and therefore the linear Stark effect is not observed. 

Let us discuss in greater detail the theory of the linear Stark 
effect for the hydrogen atom. We shall confine our treatment to 
the second quantum level (n ==2). ^ The external electric field 
(which is of the order of 10^— 10* volts/ cm in experiments) is much 
weaker than the intraatomic field produced by the nucleus, which is 

g = 5 . 1 0® volts/cm, 

nucl fl(| 

where «« is the radius of the first Bohr orbit. Therefore, we may 
use the results of perturbation theory (in the form developed for 


^The electric field intensity will have no effect on the oscillations along the x and y 
axes, which are perpendicular to E. 

3 

The first quantum level (n - 1) is nondegenerate and, therefore, is not split. 
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degenerate problems). For the perturbation V', we must take the 
potential energy of the electron in the external electric field 

V' = eo&z. (14.34) 

In the unperturbed state, the energy of the electron is [see Eq. 
(13.33)] 


which has four wave functions associated with it: 


1^2.0. 0 ^20 ('') 1^0 (^), 

(14.35) 

t: = '{’i. 1,0 = ^2i (r) Y't = y /- Ri, (r) cos », 

(14.36) 

■^5 = 1 = R21 (r) r; = yi R„ (r) e-^, 

(14.37) 

= = yiR-n(r) 

(14.38) 


When 0 and are replaced by Cartesian coordinates, these wave 
functions take the form: 


'I'i' = /i (f). 

(14.35a) 

fi=fiir) z. 

(14.36a) 


(14.37a) 


(14.38a) 


(14.39) 


where 
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The general wave function of an electron will have the form 

4 

(14.40) 

isg»\ 


Since the degree of degeneracy is four (/=:4). we have the 
following system of four equations for determining the unknown 
coefficients E\ and the corrections Q to the energy of the un- 
perturbed state: 

c; (^; ~ V'i.) - qv;, - qvia ~ = o, 

— q^^ai"f“q(q q^^w ciVqi=o, _ . 

— QV', ^ qv;, + q (q - Vn) - civ:,, = o, 

— q v'ji — qv^49 — q^ la “f* q (q — ^44)“ Oi 


where 


v:u= J f/V'f,d^X=: 

— (14.42) 

When we integrate over the volume, the matrix elements 


^ 111 V2'2, V33, ^^24* ^31 » ^32 » t^34» ^41 » ^42 V43 


vanish, since for each of them the integrand must be an odd func- 
tion with respect to at least one of the coordinates x, y and z. Only 
the matrix elements 


V 19 and V 91 — • V 12 , 

which are even functions of all three coordinates, differ from zero 


V[,= y;, = c. B J f, (r ) /, (r) 2 ’ d-'x. (14.43) 


Let us substitute the values of Eq, (14.39) for /i (r) and /i (/*) and 
note that, according to Eq. (13.28a), 


RiQ — 


2 /2 






_/• 

2ao 


Rn = 


2 ye 


re 


2ao 


and 
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Then, integrating (14.43) over the angles ^ and 9 (remembering 
that z = r cos 0), we obtain 

Ki, = V;. = J r‘ (2 - r ^ dr. (14.43a) 

Next, taking into account the equality 

00 

fe-Vrfp = r(s+l). 

0 

we obtain 

= = (14.44) 

Using the obtained values of the matrix elements V'y,, we find, 
in accordance with (14.31), that the secular equation for the correc- 
tions E'i is 


Ej 2a e„ g 
3a Co S E'l 
0 0 

0 0 


0 0 
0 0 
Ei 0 
0 E'i 


- 0 , 


which can be rewritten in the form 

£;-(/• r-9a;e;g^) = 0. 

This equation has four roots : 

E = 3ai(^o g, 

E'i''-' = 2a^e^%. 


(14.45) 


(14.45a) 


(14.46) 


each one of which, according to Eq. (14.41), is associated with a 
quite specific set of coefficients: 


p 1 ). 
'-I t 

C ( 2 ) /^i 2 ). 

1 f 

Cl*' = Ci’=0; 
C,‘’ = Ci” = 0; 


C>' = Q' = 0 

Q' = Ci“ = 0. 
C'a', Ci'9^0, 
C'i', 


(14.47) 


Here the superscript / of the coefficients Cf CJ indicates the so- 
lution (or root) of Eq. (14.45a) with which it is associated. Thus, it 
follows from (14.40) and (14.47) that the energy level 
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is associated in the zero-order approximation with the wave func- 
tion 


Because of the normalization condition 

this wave function becomes 

+ (14.49) 

In a similar fashion, it can be shown that the state with energy 

£^’ = £S + £;'‘' = -^ + 3a.e«S (14.50) 

is described by the zero-order wave function 

(14,51) 

To describe the states with energy 

£ '<S) p(4) po 

4 ^2 

which remain unperturbed by the electric field in the first-order 
approximation, we may equally well take the function 

or 

or, alternatively, we may take a linear combination of these func- 
tions, since the system remains degenerate for m = 1 even in the 

presence of an electric field. Thus, when the z component of the 
angular momentum is not equal to zero {m = ±l in units of /^), that 
is, the electron is moving largely in the xy plane (see Chapter 12, 
Section A), there will be no splitting of the energy levels (and, 
therefore, of spectral lines) in an electric field. If, on the other 
hand, the orientation of the angular momentum is such that its ^ 
component is equal to zero m = 0 and, consequently, the electron 
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is moving in a plane which includes the z axis (again, see Chapter 
12, Section A), then an electric field does lead to splitting of the 
spectral lines (see Fig. 14.1). 


"" — " 

Rh 

^-3eoOoS 

a b 

14. I Splitting of the second spectral term of hydrogen in 
an electric field (the linear Stark effect): 

a) energy level in the absence of a field (£ -0), 

b) energy level in the presence of a field (S 


Qaulitatively, the Stark effect for n = 2 may be interpreted as 
follows. Since the wave function describing the motion of the 
electron for /i~2 (see Fig. 12.1) is not centrally symmetric, the 
atom has a certain electric moment p. Consequently, when the 
atom is placed in an electric field 

(£^==£^ = 0, £, = g), 

it acquires an additional energy 

y ' ^ (pE) = - p S cos Y. (14. 52) 

where y is the angle between the direction of the electric dipole 
moment of the atom and the z axis. Comparing this expression 
with (14.46), we see that the electric dipole moment of the atom 
is p = 3a,,e’o. The solution corresponds to the case y = 0 and 
the solution the case y = '^» For the third and fourth solu- 

tions, it is necessary to set y = ^^. We note that, in the last case, 

the electric dipole moment is oriented perpendicularly to the elec- 
tric field and, consequently, no additional energy appears. To con- 
clude this discussion, the linear Stark effect arises because of the 
intrinsic electric dipole moment of the hydrogen atom at n==2. 
Predictions obtained on the basis of quantum mechanics are in 
good agreement with experimental data only in the case of weak 
fields (10* volts/cm). At higher field intensities (^^10'* volts/cm), 
there is an additional splitting (the quadratic Stark effect) due to 
the removal of the degeneracy with respect to the magnetic quan- 
tum number m. Finally, at field intensities greater than 10’’ volts/ 
cm, the Stark effect completely disappears. This is the result of 
autoionization of the atoms, that is, removal of electrons from the 
excited levels. 
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F* PRINCIPLES OF THE CLASSICAL THEORY OF 
DISPERSION 

Perturbation theory has many important applications in studies 
of the interaction between light and matter. The predictions obtained 
in this way differ from classical results and receive excellent 
confirmation from experiment. This section is concerned with the 
classical theory of dispersion (that is, the scattering of light) in 
a dielectric medium. According to classical notions, a dielectric 
is characterized by the index of refraction 


n = V e. 


where £ is the dielectric permittivity (the magnetic permeability 
(X is taken to be equal to unity). If the index of refraction n becorries 

larger as the frequency of light increases (that is, ^]>0),the 

dispersion is said to be normal. A typical example of normal 
dispersion is the spectral resolution of white light by a glass or 
quartz prism (the deflection of violet rays from their initial direc- 
tion is larger than that of redrays). If, however, ^ <^0in a certain 

range of frequencies, the dispersion in this region is said to be 
anomalous. As a rule, anomalous dispersion occurs at frequencies 
at which light is absorbed by the medium. 

To determine the index of refraction (one of the most important 
problems in the theory of dispersion), we use the equation relating 
the electric field intensity £, the displacement vector Z>, and the 
polarization vector P'* 


Z) = e£ = £+4TrP. 

Since e = /^^ we have 


P= 



(14.53) 

(14.53a) 


Thus, to determine n we are required to find the relationship 
between P and E on the basis of the microscopic picture of the 
structure of matter. We shall be better able to appreciate the 
contribution of quantum theory in this connection after we have 
completed our review of the basic principles of the classical theory 
of dispersion. 

According to the Lorentz classical theory, atoms may be 
regarded as harmonic oscillators in which, in the simplest case, 
all the electrons oscillate with the same angular frequency u). If 


4 


The polarization P is defined as the total dipole moment of the atoms per unit volume 
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we take the z axis to be parallel to the direction of propagation of 
the electromagnetic wave, then, since the wave is transverse, we 
may direct vector E along the x axis (£j^ = g, £^ = £^ = 0), and 
vector H along the y axis. If we neglect the force exerted on the 
atomic electrons by the magnetic field (since the magnitude of this 

force is only a fraction (7*^ of the force exerted by the electric 

field), the oscillation of the electrons can be described by the 
equation 

moJS + niQiolx — — ^og. (14. 54) 

We shall assume that the frequency of the incident light wave is o). 
Then 


8 = g, cos {mt — . ( 14 . 55 ) 

If the energy transported by the wave is much smaller than the 
bonding energy of the electrons in the atom, it follows that the 

ratio ~ may be neglected because it is small compared to the 

atomic dimensions a'^ 10 ® cm, while the wavelength X is of the 
order of cm). Thus, under the assumptions we have made, the 
electric field of the wave may be considered to be quasi- stationary 
inside the atom. As a result of this simplification, Eq. (14.54) 
becomes 

f (ojjc = — --pg = — COS («/ . (14.54a) 

fi»0 

Multiplying (14.54a) by ( — e^N), where N is the number of atoms 
per imit volume, and substituting 

--e^=p^, and P^z=Np^=€^, Py = P^=0, 

we reduce this equation to the form 


^ + «> 5 i^ = “ So cos u)/, 


from which we obtain 


5 * 


fflo 



go cos wt. 


(14.54b) 


(14.56) 


Comparing Eqs. (14.56) and (14.53a), we obtain an equation for the 
index of refraction which should be familiar from optics: 

^ — s A/ ~ ^ 

m, “J — »* 


(14.57) 
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We note that if the atom is assumed to contain electrons with 
different eigenfrequencies 


(Do, (Di, (Dj, ...» (D;^ , ...» 


a more general equation is obtained instead of (14.57): 


4xt 



(14.57a) 


where is the number of electrons per unit volume oscillating 
with the frequency 

It follows from (14.57) that at radio frequencies (<o<<(Do)the 
index of refraction may be assumed to be constant, without signif- 
icantly affecting the accuracy. Its value is given by the relation 


1 Nel 
4ic ^ * 


(14.58) 


On the other hand, for frequencies <o>>( 0 o, the value of the index of 
refraction is given by the relation 


yt -— Nel 
4Tt ^ mow* * 


(14.59) 


The index of refraction is, therefore, a constant greater than unity 
for (D < to,) , whereas for to to^ it is less than unity, approaching it 
as 0 ) — CO. 

At frequencies close to wq, the magnitude of the index of refrac- 
tion increases without limit, and at w = a)o it has a discontinuity 
(see Fig. 14,2). The reason for this be- 
havior of the function is that Eq. (14. 54) 
does not include the radiation damping of 

the electron ^ x, which arises 

from the interaction between the moving 
electron and its own field. When fdamp is 
included, the dispersion curve has the form 
indicated by the dotted line in Fig. 14.2. 

Consequently, the dispersion is anomalous 
near the resonance frequency cd^. Since the 
region of anomalous dispersion coincides 
with the region of the eigenfrequencies of 
oscillation of electrons in the atom, it follows that anomalous 
dispersion is accompanied by strong absorption. 



persion curve 
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F. QUANTUM THEORY OF DISPERSION 

Let us now develop a quantum theory of dispersion. By analogy 
with the classical case, we shall assume that all the electrons in 
the atoms are in the same quantum state We shall use the 
perturbation method to solve our problem, since the energy of the 
interaction with the external field is generally small compared 
with the bonding energy of the electrons. In the nonrelativistic case 
(when we may neglect the “magnetic*’ force), the external force 
which acts on an electron can be obtained from Eq. (14.55)^: 

Px = — ^ 0^0 cos (*)/. Fy = F. = 0, 


The perturbation energy is given by 

F' == cos ( 1 )/. (14. 60) 

Consequently, the Schrodlnger equation for the electron is 

= (14.61) 

Let us suppose that Eq. (14.61) has an exact solution for V"=0: 

^]{(/)==<l;2e-w'»V (14.62) 


where '{'» and satisfy the equation 

(£»-H“)<|.?=0. (14.63) 

In accordance with perturbation theory, we shall lookfor a solution 
in the form 

<[,*(/) = <1,1 (0 + <|»i(0. (14.64) 

Since (14.62) and (14.63) yield 

= (14.63a) 


^This assumption is analogous to the assumption made in the classical treatment, 
according to which all the electrons have the same eigenfrequencies of oscillation. 

^Just as m the classical case, we have assumed here that the electric field is quasi- 
stationary over distances of the order of the dimensions of the atom. 
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the equation for (0 and the first-order correction to the energy 
£* is 


Substituting V' from (14.60), we have 

(— j 4- -- Vn (t) = \ {e~ "K-"' + e- «(“*+“) }. (14, 65a) 

To eliminate the time t from this equation, let us look for a solu- 
tion ^'k (t) in the form 

(0 = + ve- " . (14.66) 

We then have the following equations for the functions u and v : 

{/I ((i)^ — (o) — H«} w == y (14. 67) 

{n («,* + «0 - H"} i> = 1 eoXMt (14.68) 

We note that these two equations have exactly the same form. 
Consequently, it is only necessary to solve Eq. (14.67) for w, since 

the solution for v of Eq. (14o68) can be obtained from u by substi- 

tuting — to for w. Since the time does not appear explicitly in Eq. 
(14.67), we can find u by the perturbation method in the form ap- 
plicable to stationary problems. Thus we shall look for a solution 
in the form of an expansion in eigenfunctions of the unperturbed 
problem [see (14.8)]: 


u = (14.69) 

where the ^1" satisfy the equation 

= (14.70) 

Accordingly, we may reduce (14.57) to the form 

H 2 Ck" («>**'■ - •») f*" = -pi. 

*" 

where the frequency of radiation is 


(14.67a) 


(14,71) 
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Let US multiply (14.67a) on the left by and Integrate over all 
space, taking into account the orthonormality of the eigenfunctions 

= We then obtain the following equations for the 

coefficients and the function u : 




eo$o 




2a a> * 


0“ 

^ \ 2» / Uj,* + « ^ ’ 

where the matrix element is equal to 

Xli'l, = J 'I'rAT'l'ft d^X. 


(14.72) 

(14.73) 


(14.74) 


Substituting — o) for <0 in (14.73), we obtain an expression for the 
function v: 


v= 




"*’* — “ 




(14.75) 


From (14.64), (14.66), (14.73) and (14.75), it follows that the 
total wave function 'll* (t) is 




P— /W; 


-'{tJ' 




■[^k'kCOSint — itosino)/]!. (14.76) 


From the wave function t|#*(/) of the electron in the external 
field, we can readily obtain the polarization vector of the medium 
In the classical theory we had 

0^z=Np = — NbqX. 

To generalize this expression to the quantum case, we must replace 
p by its average value. Then 


S^=Np=-Neo (14.77) 

Substituting from (14.76) and retaining only first-order terms 
in go» we have 




2Nel 

n 




"i'*— 


go cos oof. 


(14.78) 
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In deriving this expression, we used the relation 

which follows from the fact that the integrand is an odd function of 
X. Comparing Eqs. (14.78) and (14.53a), we obtain the dispersion 
formula 


1 _ 2Nel \ 

4jt H uilf/j — to* 


By introducing the new variable 


fit'll = lOk'lt 1 Xk'li I’ 


(14.79) 


(14,80) 


which is called the oscillator force, we transform Eq. (14.79) to 


— I _ iVgg y fk'k 

47c mo toJ,|^— to* 


(14.81) 


Here let us make an observation similar to the one made in regard 
to the classical treatment: namely, if we had included the radiation 
damping in the quantum-mechanical treatment, we should have 
obtained a finite value of n® for frequencies w in the neighborhood 
of (see Fig. 14.3a, the dotted line). 



Fig. 14.3, Dispersion curves 

a) positive dispersion 

b) negative dispersion 


Equation (14.81) has a structure similar to the classical equa- 
tion (14.57). In actual fact, however, the quantum results are 
fundamentally different from the classical results. From quantum 
theory it follows that anomalous dispersion occurs in the neighbor- 
hood of frequencies corresponding to allowed transitions, and not, 
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as in classical theory, in the neighborhood of the eigenfrequencies 
of oscillation of the electrons. This particular conclusion can be 
seen to be directly related to the role of the oscillator force fk'k 
in (14.81), which is specified by the matrix element Xk'k [see (14.80)] 
which characterizes the selection rules (and thus the allowed 
transitions). This prediction of quantum theory was experimentally 
verified by D. S. Rozhdestvenskiy. 

A second, very important difference from the classical results is 
that quantum theory leads to negative dispersion (see Fig. 14.3b)— a 
phenomenon which has no classical analog. This can be under- 
stood by simply noting that when light is scattered by excited 
atoms, it is necessary to take into account the states with Ek'^Ek' 
for which 






For these states the dispersion formula (14.81) becomes 


■— 1 Nel^ Ifk'kl 

47t trio ^ 0)^,^ — oj2 > 


(14.81a) 


and the dispersion curve is represented by the dotted line in Fig. 
14.3b. The experimental discovery of negative dispersion was made 
by Ladenburg; thus, this prediction of quantum theory was also 
confirmed. 

Let us now find the value of the oscillator force fk'k and the 
dispersion formula for a harmonic oscillator. The only nonvanish- 
ing matrix elements in this case are [see (10.55)] 





and Xjj 



“By chance,’^ it turns out that the quantum-mechanical frequencies 
of radiation are identical with the eigenfrequencies of oscilla- 
tion 


= 3.nd 


We thus obtain 

= 1). — (14,82) 

Consequently 4inceV iV the dispersion formula (14,81) can 


be written as 
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Nel k __ Nel 
nh 


(14.83) 


We can see that in this particular problem the quantum and clas- 
sical theories yield the same value for the index of refraction ru 
The phenomenon of negative dispersion is not observed. The 
reason for this is that the regions of positive and negative disper- 
sion coincide since so that the stronger effect of 

positive dispersion masks the negative dispersion. 


G. RAMAN EFFECT 


Let us consider the phenomenon of dispersion from the stand- 
point of energy diagrams. Suppose that a photon with energy 


6 = /J(» 


(14.84) 


impinges on an atom with only three energy levels 
(see Fig. 14.4). In general, the scattering of this photon (that is, 
dispersion) will be a second-order effect. The first form which 
this process can take is absorp- 
tion of the photon. This is ac- / 

companied by excitation of an 

electron from level k to some 

intermediate state (which may 

even be a forbidden state^; see , ^ e 

Fig. 14.4,1) and, subsequently, ^ 

by emission of a photon. If, as ^ 

a result, the electron returns 

to its initial state, it follows 

from the law of conservation of H 

energy that the frequency «/ of 

the scattered photon is the same Fig. ha. Energy diagram for photon 

as the frequency o) of the inci- scattering 

dent photon. energy of the incident photon. 

Alternatively, the order of photo". 

i. 1 I anti n represent elastic scattering 

the process may be reversed: and iii and iv 

the atom first emits a photon represent induced transitions 

(see Fig. 14.4,11) and then ab 
sorbs the incident photon. As 

in the preceding case, the frequency m of the scattered photon will 
be equal to the frequency oj of the incident photon if the atom re- 
turns to its initial state. 


Fig. It. 4 . Energy diagram for photon 
scattering 

h(o IS the energy of the incident photon, 
h(0 , the energy of the scattered photon, 
I and II represent elastic scattering 
and hcv/- III and IV 

represent induced transitions {]%(*> 
or hu) 'V 


More precisely, the law of conservation of energy may be violated in intermediate 
states It IS required to hold only in the final result 
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fico 


fioj" 




1 



-A'* 


Finally, resonance occurs when (*)?!« (o/e**. in this case, both 
processes— scattering and absorption of the photons— take place 
(see Fig, 14.4, III); as a result of the last process the electrons in 
the atom undergo induced transitions. The probability of these 

transitions is given by the Einstein 
coefficient Bkk' [see (9.21)]. An 
external field increases the number 
of downward transitions (see Fig. 
14.4, IV), which results in some 
additional radiation proportional to 
the coefficient 

So far, we have been concerned 
with cases in which atoms return to 
their initial state after scattering. 
It may happen, however, that after 
the atom has absorbed the incident 
photon, the electron does not return 
from the intermediate state to the 
level k, but instead makes a transi- 
tion to the level k' or k" (see Fig. 14.5). In this case, the frequency 
of the scattered light (w' or w") is not equal to the frequency of the 
Incident light. This type of scattering is called the i^aman e//ec^, 
after the Indian physicist who first discovered this phenomenon in 
liquids. In solids the Raman effect was discovered by the Soviet 
physicists L. I. MandePshtam and G. S. Landsberg (1928). 

From Fig. 14.5, it can be seen that the frequency of the scattered 
photon may be either lower or higher than the frequency of the 
incident photon. In the former case, the lines 


Kig. 14i.5. The Raman effect. 
hw IS the energy of the incident 
photon, h(A)\ the energy of the scat- 
tered photon corresponding to “Stokes* 
lines; and the energy of the 

scattered photon corresponding to 
“anti-Stokes* lines 


Oj' == 0) (0 

correspond to excitation of the atom, since the atom ends up in a 
higher energy state. These lines are known as “Stokes’* lines (the 
levels are shifted towards the red part of the spectrum). The 
second case corresponds to “anti-Stokes” lines (shifting towards 
the violet part of the spectrum): 


lu" = (0 -j- > (O; 


these lines appear only whenthe light is scattered by excited atoms. 
It is obvious that at low temperatures only Stokes lines can be 
observed. As the temperature increases and the atoms of the 
substance begin to undergo transitions to excited states, anti- 
Stokes lines appear. 

The Raman effect provides much important information in 
studies of molecular structure. In Chapter 12, Section C. we saw 
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that the rotational and vibrational levels 
rotational levels), which provide data on 
all located in the far infrared region of 
the spectrum and are very difficult to 
observe* In studies of the Raman effect, 
it is possible to use visible light in 
determining molecular spectra, since 
these spectra are superposed on the 
lines in the spectrum of the incident 
light. The experimental values of <«' and 
o/' (see Fig. 14,6) yield the molecular 
frequencies 


(and also the vibrational- 
molecular structure, are 


b[ 


d 

1" 


□ 


(U 




w' 

iJ* 


1 1 1 1 


(o/fZ/j (0 — a/ and 


- 0 ), 


from which the selection rules can be 
derived. 


Fig. 14.6. Superposition of the 
molecular frequencies on the 
frequency of incident light: 
a) spectral line co in the ab- 
sence of molecular oscillations, 
and b) shift of the spectral line 
due to molecular oscillations 
CO— CO and CO* CO ^ 


\ Find the energy correction for a system in second-order perturbation 


(problem 

the(5fy7* 

Solution. Including the terms up to and including the second order in the expansions of 
the wave function (14.3) and the energy t (14.4), and substituting these expansions into 
the Schrodinger equation (14,2a), we obtain the equation 

(/•“ - H’) = - (£'„ - V'W^ - ElC 

Since the solution of the homogeneous equauon(£^ — is orthogonal to the 

right-hand side and since we can substitute the expression (14.22) for we have 


X' 


Ik’. 1“ 

j n n 


(14.85) 


The value of is given by (14,15), and we have used the relationship 

nn' '' n’iV 


which holds for Hermiti an operators. 

We note that the second-order correction (14,85) to the energy of the ground state is 
always negative, since all the other levels En' are higher than the level fj, that is, 

(l^oblem 14,y Using the results of the perturbation theory, find the energy of the 
anhdrrrifiTSnr^'gCTlTator including the terms up to the order of fi^\ take the Hamiltonian of 
the system to be equal to 






‘2//io 




where V* = ax^ -|- (the constants a and P are classical quantities). 
Solution, The energy of a harmonic oscillator (F’ = 0) is 


Taking V' as the perturbation energy, the first-order approximauon gives 

E'n=Vnn = Oi(x‘)„n + Hx')nn- 
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It can easily be shown that 


4.^ 

(X^)nn= J \^„l'x»dx = 0, 

— 00 


since the Integrand is an odd function. 

To calculate the matrix elements P (x*)nn, we use the multiplication rule for matrix 
elements (see Problem 10.4), obtaining 


(X%, = ((xX + ({xX_ „)» + ((xX. „ + 2 )>. 

k 

Substituting the value of {x^)nk from Problem 10.5, we obtain the following expression 
for the first approximation of the perturbation energy Eni 

Our problem, however, is not yet fully solved, since in the second-order approximation 

a“ 

there is a contribution proportional to ~ ^ which arises from the first term of 
the perturbation energy a-v'’, and we must take this contribution into account. The 

A'** 

second-order contribution from the term pA* is proportional to h\ and accordingly 

it may be neglected in our approximation. The second-order correction arising from 
the first term of the perturbation energy can be calculated from Eq. (14,85): 

... 0^ V 

no> (/I — n') ■ • 

n' 

The only nonvanishing matrix elements are [see Eq, (10.55) and Problem 10.5] 


r,-a. 1 A‘(«— >)(« — 'J) 

)n, /i-B — \ ^ )n^n a v^)n -a, n s — 1/ ' ^ ^ > 


n+3 “ ^ /i» 


where Xq — 


V 




Hence 


tn~ ^ , 4 n’* -4- rt + . 

4 mja>< V ' dO y 


(14.87) 
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The Klein-Gordon Scalar Relativistic 
Wave Equation 

'Relativistic mechanics and the klein-gordon 

EQUATION. RELATIVISTIC INVARIANCE 

The Schrbdinger wave equation Is nonrelativistic: it is suitable 
only for particles whose velocity o is much smaller than the 
velocity of light c. It^ not invar iant with respect to the Lorentz 
iransformations of tile special theory of relativity since'th^e ^ 
in asymmetry between^ the time and space "coordinates (the? 
squhtion contains a first derivative with respect to time, and 
second derivatives with respect to the space coordinates). Ac- 
cording to the special theory of relativity, it is necessary for the 
time and space coordinates to be treated on the same basis. 

It is interesting to note that the de Broglie relations 

p = £ = Au) (15.1) 

are relatively invariant. In the Lorentz transformation, they 
behave like a four-vector with components 


P*-?. (15.2) 

This indicates that it is possible to generalize quantum mechanics 
to the case of particles traveling with a velocity of the order of 
the velocity of light. 

A method of extending the nonrelativistic wave equation in a 
way consistent with the special theory of relativity was proposed 
Klein and Gordon in 1926. (This method was also put forward 
by Schrodlnger and by Fok.) The simplest way of obtaining the 
Klein-Gordon equation consists in taking the relativistic relation- 
ship between the energy £, momentiun p, and rest mass mg of a 
free particle 


— c V — m Jc* = 0 , 


(15.3) 
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substituting the energy and momentum operators 

E = ih~, 9 = — ihV, (15.4) 

which act on the wave function ^>(r, /). Replacing mo by Hk^slc dind 
dividing by we obtain^ 


\ dt‘^ 


(15.5) 

Xx=X, 

11 

ll 

II 

(15.6) 


(a double occurrence of the subscript p in a term indicates that it 
should be summed from 1 to 4), Since our initial equation was the 
relativistic relationship (15.3), Eq. (15.6) is relativistically in- 
variant and, therefore, it is symmetric with respect to the space 
and time coordinates. We shall not attempt to prove the invariance 
of the Klein-Gordon equation more rigorously, and shall now pro- 
ceed to examine its properties, 

B.y^HE CHARGE AND CURRENT DENSITY 

As in the nonrelativistic theory, equations for the charge and 
current density can be obtained on the basis of the equation of 
continuity 




'+ 4 ' = 0 - 


(15.7) 


We multiply (15.5) on the left by and the complex conjugate 
of this equation [that is, Eq. (15.5) with substituted for ] by 
<]> . Then we subtract the second of the resulting equations from 
the first 


— I,- (t* t — I- 4^ t") = 0- (15.8) 

After some simple transformations, Eq. (15.8) becomes 

V . {^ V - <1-* V 4 '1' - J- 1*} = 0- (15.9) 


Here and in the subsequent chapters, we shall not write as a function of t, as was 
done m the Schodingcr theory. In the wave equation for monochromatic (K - const) waves, 
for which only the time-independent part of the </» function must be considered, we shall 
use the energy eigenvalue instead of the operator. 
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(15.10) 




(15.11) 


note that these expressions satisfy the equation of continuity 
.7). Moreover, they define a four-dimensional vector 






(15.12) 


ere 


Xr^ = ict, ]\z=zicp. 


(15,13) 


e current density (15.11) is identical with the nonrelativistic 
}ression (5.21), and the charge density (15.10) reduces to the 

irelativistic expression (5.20) when Substituting ih E 

e (15,4)] into (15.10), we obtain 


P 


j!L 

nioc^ 




(15.14) 


ich becomes the usual expression p = in the nonrelativistic 
)roximation Thus we have selected a normalization in 

ich the relativistic values of p and j reduce to the corresponding 

irelativistic expressions when ( " ^^<1- 

It is worth noting that the definition of the particle density 
distinguished from the charge density) 


p ih 

~ e 2Woc“ 




(15.15) 


es rise to some difficulties in the relativistic theory. The 
jin-Gordon wave equation is a second-order differential equation 

i, therefore, both and can be arbitrarily defined at some 

en time t. Consequently, the density pn (15.15) is not positively 
ined, unlike the nonrelativistic probability density 


Po = 'l'*t- 


(15.16) 
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Accordingly, the expression (15«15) cannot be Interpreted as the 
particle density (that is, the “number of particles per unit 
volume’’). The underlying reason for this is that the same rela- 
tivistic equation describes particles with either positive or nega- 
tive charge (and, indeed, tc mesons, to which the Klein-Gordon 
equation is applicable, may be either positive or negative in charge). 
The quantity po , therefore, can have both signs. 

C. RELATIVISTIC THEORY OF THE HYDROGEN ATOM 
(NEGLECTING THE ELECTRON SPIN) 

To treat the interaction of a particle with an electromagnetic 
field (defined as usual by a vector potential A and a scalar po- 
tential ti>), we introduce the same operators as in the nonrela- 
tivistic case 


E = p — — m — (15.17) 

From (15.3), we can obtain the Klein-Gordon equation^ 

{(«ft ± - e®)’ - c’ ( jftV + -f i4)’ - my] <1. = 0. (15. 19) 

We shall use this equation to study the spectrum of the hydrogen- 

Ze^ 

like atom. Setting /1=0. and = y~ in (15.19), we have 

V’t+-^{(E-F)’-m;c*}t = 0. (15.20) 

Since the potential energy does not depend on time in this equation, 
we can transform to the time-independent case by means of the 
substitution 


In this equation, we have not included the rest mass energy my of 
the particle in the energy E. As a result, Eq. (15.20) becomes 

[(^ + 'f = (15.21) 


^This equation can also be obtained from the relativistic Hamiltonian for a particle in 
an electromagnetic field 


H = t cl) . (IS 18) 

It IS only necessary to transfer to the left-hand side, square both sides, and replace 
p and A by their quantum-mechanical operators. 
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Just as in the Schrodinger theory, we shall look for a solution to 
this equation in the form 

4.=:/?(r)yr(e, (p). (15.22) 

The equation for the radial part is 

+ (16.23) 


Here = is a dimensionless quantity, called the fine 

structure constant, and 




(15.24) 


When c^-^oo, the expressions (15.24) reduce exactly to their non- 
relativistic counterparts (see Chapter 13). 

The somewhat improved values that we have obtained for A 
and B by taking into account the relativistic effect do not change 
in any way the general character of the solution that was obtained 

in nonrelativistic theory. Formally, the additional term -pr- in 

Eq. (15.23) can be treated as a relativistic, attractive potential 
energy, which obeys an inverse square law and which affects the 
solution under certain conditions. A detailed analysis of the role 
of this term will be given below. 

First of all, let us consider the asymptotic solution Rq as 
In this case, Eq. (15.23) reduces to 


_L /?, = 0. (15.25) 

We shall look for a solution of this equation in the form 

R,=Cr^. 

We then obtain an equation for s 


s(s+ !)-/(/+ 1) + ZV = 0, 

(15.26) 

the solution of which is 



(15.27) 

Consequently, 


/?o 

(15.28) 
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If 


z«<4. 

both roots Sy and s, will be real for all values of / (/ = 0, 1, 2, . . .)• 
We retain only the solution for rR^ that does not diverge at r = 0; 
that is, we set Similarly, only the exponentially decreasing 

solution for the wave function as r — oo should be kept when 
£'<;0 (/I >0). The asymptotically decreasing solutions for the two 
limiting values of r yield the same equation for the energy spectrum 
as in the Schrbdinger theory, as can be seen from Eq. (13.20) by 
substituting Sy for /. Thus, for the eigenvalues we have the equation 

Y-A=^ + -2 (15.29) 

Substituting the relativistic values (15.24) of the constants A and 
B, we have (for n=k + l + l) 


■ niec^ 


H- 






(15.30) 


Expanding this expression into a series of powers of and 
retaining only the first two nonvanishing terms, we obtain an 
energy spectrum which Includes relativistic effects: 






/ 

(/+ 



(15.31) 


The first term is identical to the nonrelativistic expression. The 
second term, which is proportional to the square of the fine 
structure constant a 1/137, gives the relativistic correction. 

The relativistic correction for the hydrogen atom (2=1) is 
interesting because it removes the degeneracy with regard to L 
The level for a given n is split into n closely spaced sublevels 
(the close spacing is a consequence of the smallness of a^) because 
the orbital quantum number I can assume n different values 
{/ = 0, 1, 2, . . . , n — 1 ). In order to compare these results with 
experiment, let us compute the distance between the doublet states 
of the Balmer series {n = 2). We find 


A(o = 


Zfsi — E20 


8 

3 16 ‘ 


(15.32) 


Experimental data show that the actual distance between doublet 
states in the Balmer series is only one third of the distance 
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given by Eq* (15.32). This discrepancy arises because the fine 
structure of the hydrogen levels cannot be explained entirely in 
terms of the relativistic relationship between mass and velocity. 
As we shall see in Chapter 19, it is also necessary to consider the 
electron spin (that is, the intrinsic angular momentum of the 
electron, which gives rise to a magnetic moment). At first it 
was thought that the Klein-Gordon equation could be used to describe 
a relativistic electron. As a result of the discrepancies between 
its predictions and experiment, however, it was established that 
it describes particles with spin of 0, whereas the electron has a 
spin of 1/2. Consequently, the Klein-Gordon equation can be used 
for 7u mesons, which have a spin of 0. 

Finally, let us consider the case in which 


Za>-^ (15.33) 

in Eq. (15.27). In this case, the solution does not consist of a 
correction added to the nonrelativistic solution, but is fundamentally 
new. Indeed, for / = 0 both roots Si and are complex. Therefore, 
the asymptotic solution (15.28) is 

= + (15.34) 


where ZV---. We cannot Impose the condition C 2 = 0 (or 

Ci = 0) on our problem because both solutions have the same 
singularity as r-^0. Since the solution remains unrestricted by a 
potential barrier at small r, even when E<^0,the energy spectrum 
for 1 = 0 is continuous. Consequently, the particle will “falF^ to 
the center. 

The question of the stability of the motion of the particle is very 
important in studies of the central forces. The above results can be 
used to analyze the solution of the Schrbdinger equation in the 
general case of an attractive potential 


V 


2mo r'? * 


(15.35) 


On obtaining the asymptotic solution for r-^0 [15.25], we see that 
the solution will vanish at the origin only for a maximum value of 
<7 equal to 2 and that the particle will not fall to the center if 



It is interesting to note in this connection that the r® dependence 
of potential energy occurs fairly frequently in the theory of 
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elementary particles since the potential energy of this form charac- 
terizes the interaction oftwo elementary magnetic dipoles. Actually, 
two cases have to be distinguished. In the first case, only at 

relatively large distances, while at small distances V varies as 
This behavior of V is observed for the spin-orbit interaction 
in the Dirac theory and it does not give rise to any difficulties. 
Moreover, it is found that stable motion corresponds to a value of 
Z greater than any in the present periodic system of elements 
(Z 137) because the spin effects reduce the influence of the 
relativistic effects (whereas in the Klein-Gordon theory Z is con- 
fined to relatively small values 2 < y 137). 

In the second case, V continues to vary as r"* even at small 
distances (r-vO), and particles cannot be combined into atomlike 
systems. This case can be observed in the meson theory of 
nuclear forces, where quasi-magnetic interactions are of con- 
siderable importance. The formation of an atomlike system be- 
comes possible only if the potential is cut off at small distances 
from the origin. 


_ Problem 15. l.\ Find an expression for p and / If the scalar relativistic equation con- 
talrRmSl 111 '■WSTfig from the presence of an electromagnetic field. 


p — Ay E — in the Klein-Gordon equauon. 


Solution, Let us substitute p 
which now describes the motion of a particle in an electromagneuc field. 


Repeating the calculations that lead to Eq, (15,15), we obtain the generalization 




dx„ dx» ^ 


■ flc 




where the four-dimensional potential has the components 

/4>}. 

(^Problem 1 5,2^ Show that in the case of time-independent potentials A and the 
spacS^'iiiuiii UHiiJ TOordinates in the Klein-Gordon equauon can be separated and the wave 
function can be written as 


^(r, t) = ^(r)e ** . 

Find the wave function of a free particle described by the Klein- 
Gordon equation using for normalization the expression for the density p . Show that in 
the relativistic case, p is the charge density rather than the particle density. 

Solution. Suppose the momentum of the particle is directed along the z axis and 
that the partite travels in a segment of length L (the one-dimensional case). The 
solution of the Klein-Gordon equauon can be written in the form 





1 

YT 


{J[g-UKt + az QgicKt-IH^^ 
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where E=±. cflK is the energy of the particle. Since the charge density is given by the 
expression 


2moC*F dt df 


the total charge is [see (15,14)] 


1/2 

-i/2 


It follows that p is the charge density rather than the particle density, since this re- 
lation can be Interpreted only if it is assumed that particles described by an amplitude 
B (negative energies) have a charge of opposite sign from particles described by an 
amplitude A {E> 0). 
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Motion of on Electron in a Magnetic Field. 
Electron Spin 


In 1896, Zeeman found that when atoms are placed in a strong 
magnetic field, their spectral lines are split into several compo- 
nents. This phenomenon is known as the Zeeman effect. The 
Zeeman effect played an important role in the investigation of the 
structure and magnetic properties of the atom. It led in particular 
to the discovery of the spin (intrinsic mechanical moment) and 
magnetic moment of the electron. Accordingly, it is worth elaborat- 
ing the theory of this effect in some detail. 


A. THE CLASSICAL THEORY OF THE ZEEMAN EFFECT 

The simplest model of the radiating atom in Lorentz^ selection 
theory is based on the assumption that the electron moves under 
the influence of an elastic force 

F=^kr. (16.1) 

The elastic constant is related to the electron mass and the 

angular frequency of oscillation by the expression 

k==^nk(f)l. (16.2) 

The equation for the oscillations of an electron in a homogeneous, 
constant magnetic field //, therefore, becomes 

m^r + mQ%r = — r x (16.3) 

where e== — is the electron charge. Taking the components of 
(16.3) along the coordinate axes (the z axis is chosen in the direc- 
tion of the field //, so that H^^=Hy = 0, we find 

“0, 5 + <«ol/ — = 0, 




(16.4) 



MOTION OF AN ELECTRON IN A MAGNETIC FIELD 


269 


Multiplying the second equation by i (i— V-1 ) and adding it to the 
first, we obtain 


e + coj? — 2M = 0 


(16.5) 


where 0 = ^^ is the Larmor frequency of precession and %=x-\-{y. 


2moC 

For the solution of (16.5) is of the form^ 

and it follows from (16.4) that the z component is 

2 = 


(16.6) 


(16.7) 


From the above expressions it can be seen that the frequency of 
oscillation of the electron (a three-dimensional oscillator) changes 
under the influence of a magnetic field. An atom placed in a 
magnetic field should emit radiation at three frequencies: 

0)0 — 0, O)o, (Do-f-o. (16.8) 

According to the classical theory, an oscillator does not emit 
energy in the direction of oscillation. Therefore, when we observe 
the light emitted by an atom in the z direction (the direction of the 
magnetic lines of force), we are able to detect only two lines (there 
will be no component due to oscillations along the z axis). In 
other directions, we are able to observe all three components (the 
normal Zeeman effect). Equations (16.6) and (16,7) indicate that 
the oscillations are resolved in a longitudinal component in the 
direction of the magnetic field (the z axis) and two transverse 
components corresponding to two directions of rotation (a right- 
handed rotation and a left-handed rotation). Thus, the magnetic 
field has no effect on the longitudinal oscillations and acts only 
on the circular rotations in the plane perpendicular to the magnetic 
field. 

In quantum theory a change in the frequency of oscillation is 
always associated with a change of energy. At first glance, it may 
seem strange that the magnetic field changes the energy of the 

electron, since the Lorentz force F= ^ vxHis perpendicular to 

the velocity, and therefore the work done by this force, just as the 
work done by any centripetal force, must be equal to zero. On the 


^Even in the case of very strong fields 10^ gauss) the quantity o is of the 

order of sec whereas the fre<^uency of oscillation of an electron in an atom (the 

optical spectrum) is (oq 10^^ sec~ Therefore, the inequality o « wq is practically 
always satisfied 
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other hand, it Is well known that an electron rotating in a circle 
(that is, a current loop) forms a dipole. The energy of this dipole 
in a magnetic field is equal to 

( 16 . 9 ) 

These two conflicting conclusions may be explained in the following 
manner. As the magnetic field chaises from zero to a certain 
constant value = the electron experiences a force directed 
along one of the components of the electric field g. This force 
imparts an additional energy to the electron. The magnitude of 
this component of the electric field can be found from Faraday’s 
law of induction (second Maxwell equation): 

Assuming thato^T and g depend only on time and that the switching 
on of the magnetic field does not alter the radius of the stationary 
orbit, we find 


The additional velocity imparted to the electron (e= —eo) by this 
electric field can be found from the equation 

^ yiTiag;^ ^0 ^ 

dt mo ‘ 2 moC dt • 


which gives 

InioC 

As we can see, is independent of the rate of change of the 

magnetic field when it is switched on. Since the magnetic field is 
directed along the z axis and the induced electric field (and there- 
fore also is perpendicular to it and to the radius of the orbit, 

we may write in vector form 

From this it is clear that the magnetic field produces an additional 
rotation of the electron (Larmor precession) with an angular 

velocity 0= 5^^^, 

We may now determine the unknown additional energy acquired 
by a rotating electron when the magnetic field is switched on. Since 
the energy of an electron placed in a magnetic field is determined 
entirely by its kinetic energy, we have 



MOTION OF AN ELECTRON IN A MAGNETIC FIELD 


271 


Y mag — ^ 


where v is the velocity of the electron before the magnetic field is 
switched on. Retaining only terms that are linear with respect 
to we obtain 

,.) = ^//.(r xzi). (16.10) 

Comparing (16.10) with (16.9), we see that the magnetic moment 
of an electron moving in a circle is given by the expression 

= rxii. (16.11) 

Recalling that its angular momentum is equal to 


L = m^ r xVy 


we find a simple relation between these two quantities 





(16.12) 


It is worth noting that the magnitude of the magnetic moment can 
also be found from other considerations. As we know, the magnetic 
moment of a current loop is 





where /i® is a unit vector normal to the plane of the current loop. 
In the above relationship the current strength is equal to 




2nr’ 


where T = is the period of rotation, while the area enclosed 
by the current is 

Combining these last relations we again obtain the expression (16.11) 
for the quantity ^ . 

B. THE ZEEMAN EFFECT IN THE NONRELATIVISTIC 
SCHRODINGER THEORY 

In order to obtain the Schrodinger equation for an electron 
moving in a magnetic field, we shall use the general rule for trans- 
formation of the classical Hamiltonian to the quantum case (see, for 
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example. Chapters). To do this, we substitute the momentum oper- 
ator p into the classical expression for the energy of an electron in 
the presence of electrostatic and magnetic fields. The Schrodinger 
equation for the central forces in the presence of a magnetic field 
characterized by the vector potential A then takes the form 

(£ — = 0 ( 16 . 13 ) 

where the Hamiltonian of the Schrodinger equation is 

HS ( 16 . 14 ) 

^ ‘ 2//io 


and the operator P = p— ^ /I is called the generalized momentum 

operator. For the case of a constant, homogeneous magnetic field 
directed along the z axis (H^ = Hy = 0, = we may write 


Using the fact that 




where 


and neglecting the terms proportional to the square of the magnetic 
field strength we find 


A .p5<j,^0. 

Remembering that 


nioc 


A • p 


'Znhc 


L.. 


( 16 . 15 ) 


( 16 . 16 ) 


where 



we reduce the Schrodinger equation to the form 


a6.i7) 
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It is easy to show that this equation is satisfied by the usual 
wave function for a centrally symmetric field: 

i. = /?(r)Kr(&. <f). (16.18) 

Substituting this solution into (16.17) and recalling that 

L^yr=ntnYr. 

we obtain the equation 

which also includes the effect of the magnetic field on the atom. 
This equation may be written as 

(£-HS )(. = 0. 


where we take the charge of the electron as 

e = — eo. 

The last term in the Hamiltonian may be attributed to the presence 
of the orbital magnetic moment of the atom, which gives rise to an 
additional energy 

(16.20) 

Therefore, the orbital magnetic moment obtained on the basis of 
the Schrddinger theory is 


2moC 


m. 


(16.21) 


Recalling the expression for the 2 component of the angular momen- 
tum 


tjg — htUy 


we obtain the same relation between the magnetic moment and the 
angular momentum as in the classical theory [see (16.12)]: 


Lg 2moC * 


(16.22) 


It follows, therefore, that the components of the orbital magnetic 
moment are multiples of a certain unit magnetic moment 
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= erg.gauss-‘, (16.23) 

which Is called the Bohr magneton. 

The orbital magnetic moment of an electron is one of the most 
important magnetic properties of an atom. It can be seen from 
Eq, (16.20) that the additional energy of an orbital electron placed 
in a magnetic field is given by the expression 


(16.24) 

IfTloC 

since is a constant, where o is the Larmor frequency. Because 
of the selection rules for the magnetic quantum number {^m = 0, 1), 

the additional radiation frequencies due to the Zeeman splitting 
are the same as in the classical theory [see (16.8)], namely, 

Au)=:— ^-- = 0 Am = 0, “*“0. (16,25) 

The normal Zeeman splitting ^ of the spectral lines (triplets 
and doublets) is encountered only in the case of a strong field (the 
Paschen-Back effect) or in the case when the total spin of the 
electrons in the atom is equal to zero (for example, in parahelium, 
whose outer shell consists of two electrons with oppositely directed 
spins). In cases in which the spectral lines are split into more than 
three components, the Zeeman effect is said to be anomalous. 
The anomalous Zeeman effect is connected with the spin properties 
of electrons, and an explanation of this effect can be constructed 
only on the basis of Dirac ^s theory, which takes into account the 
spin effects (see Chapter 20), 


C. THE EXPERIMENTAL DISCOVERY OF ELECTRON SPIN 

It was shown in the last section that the Schro dingar theory is able 
to explaiq^ gnly the orbital angular momentum and magnetic moffifent 
ofjin electron. The basic equations characterizing these properties 
are Eq. (16.22) for the ratio of the orbital magnetic moment and 


2 

Let us note that the reasons for the use of the terms “normal Zeeman effect” and 
“anomalous Zeeman effect” are purely historical Before the discovery of electron spin, 
only the classical theory of triplet splitting (normal Zeeman effect) was known When a 
more complicated splitting was discovered it was called the anomalous Zeeman effect 
because no theoretical explanation could be given for it until the development of the theory 
of electron spin 
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the orbital angular momentum and Eq. (16.24), which indicates that 
the number of possible orientations of the magnetic moment relative 
to the z axis is necessarily odd, since the 
number of states with different quantum 
numbers m is equal to 2/ + 1. TheSchrbdinger 
theory, however, does not adequately account 
for all the experimental data, the analysis of 
which led to the discovery of the spin prop- 
erties of electron. Let us briefly discuss 
these e5q>erimental results. 

1. First of all let us consider the Einstein- 
de Haas experiment (1915), which was carried 
out in order to verify Eq. (16.22): 



Fig 16 I. Diagram of the 
Einstein-do Haas experi- 
ment for the determination 
of the Lande g factor. 
1) quartz fiber, 2) cur- 
rent carrying coil, 3) fer- 
ro-magnetic rod. 


^ 2moC » 

where g , the Lande factor, should be equal to 
unity for orbital moments. In this experi- 
ment, a ferromagnetic rod is suspended on 
a quartz fiber and magnetized by passing a 
current through a coll (see Fig. 16.1). As 
a result, the rod acquires a magnetic moment 
and an angular momentum whose magnitude 
can be determined from the angular rotation 
of the quartz fiber. If an alternating current is passed through the 
coil, an alternating torque will arise, causing torsional vibrations 
in the ferromagnetic sample. In addition, resonance can be used 
to enhance the rotational effect. Experimental measurements of 
the gyromagnetic ratio (16,22) show that the sign of this ratio is 
negative, so that it can be definitely concluded that the magnetiza- 
tion of the ferromagnetic sample is due to the motion of electrons. 
The value of the Lande g factor, however, turned out to be equal 
to two (^ = 2), rather than to the unity that was required by the 
classical or Schrbdinger theories. This g value was not explained 
until the development of the theory of electron spin (see below). 



Fig. 16.2. Diagram of the Stern-Gerlach experiment for the 
determination of the magnetic moment of monovalent atoms 
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2. Stern and Gerlach (1921) studied the behavior of a beam of 
atoms in an inhomogeneous magnetic field in order to check the 
theoretical result (16.23) 


1 ^^ = — 

which describes the spatial quantization. In their classical experi- 
ments a beam of monovalent atoms (hydrogen^ lithium, silver), 
traveling along the x axis, crossed a magnetic field directed along 
the 2 axis {Hj, = Hy = 0, = This magnetic field was very 
inhomogeneous, so that it had large gradients. Then a magnetic 
dipole of moment 

fl = ^magA (16.26) 

where ^mag is the “magnetic charge’’ and / is the length of the 
dipole, will e;Q)erience a force directed along the z axis^ 

i-'z = fimag (Z) — a5r (z — / COS a)} = 

= emag<COSa--^-- = Ii^-^- = — (16.27) 


Let us calculate in a simplified fashion the displacement ex- 
perienced by a particle under the action of the force h\ during the 
time t. If the particle moves with a velocity perpendicular to 
the magnetic field (that is, to the z axis) and travels a distance 
L = vt^ the displacement along the direction of the z axis will 
equal 


SZ 


2 M' dz'' 


(16.28) 


/T 

In this case the acceleration is a!; = where the force is given 

by (16.27), and M is the mass of the atom. Consequently, a beam of 
particles possessing a magnetic moment p will be split into 
components as it passes through an Inhomogeneous magnetic field. 
The number of components is determined by the possible number 
of projections of the magnetic moment on the direction of the 
field. 

In their experiments. Stern and Gerlach studied the splitting 
of a beam of atoms in the s state. In this state, the angular 
momentum and consequently the magnetic moment of an atom are 
equal to zero (/ = m = 0), and therefore there shouldbe no splitting. 
If the electron is in the p state (^=1), then triple splitting should 


3 

We note that in order to determine the motion of the center of mass of the magnetic 
dipole, it IS quite immaterial whether we regard it as a rigid dipole or a current loop 
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be observed because of the three possible values of the magnetic 
quantum number 


8z = 0(m = 0). = (m = ±l). 

Experiments on hydrogen, lithium, silver, and other atoms show, 
however, that the beam is split into only two components. This 
proved the existence of a magnetic moment for atoms in the 
s state. The projection of this magnetic moment on the z axis can 
assume only two values. The measurements of the quantity fx 
showed that it is equal to one Bohr magneton 


= (16.29) 

In order to reconcile the results of these two classical experi- 
ments, Uhlenbeck and Goudsmit introduced the hypothesis that 
an electron posses an intrinsic angular momentum in addition 
to its orbital angular momentum. At first it was believed that 
this intrinsic angular momentum could be treated by analogy 
with a top spinning about an axis, and therefore it was called the 
electron spin. It must, however, be emphasized that no rigorous 
classical theory of spin exists. According to the hypothesis of 
Uhlenbeck and Goudsmit, the intrinsic angular momentum of an 
electron is equal to 


s, = ±1a; (16.30) 

that is, the quantum number characterizing its projection on the 

z axis takes on half- integral values zb . The important 

distinction between the integral (orbital magnetic quantum number 

m) and the half- integral (spin quantum number ) values of 

quantum numbers lies in the number of possible states. Integral 

quantum numbers always give us an odd number of states (for 

/=:0 we have one state /^ = 0; for l = \ there are three states 

m==0, +1, —1; and so on). On the other hand, half- integral 

quantum numbers give us an even number of states (for example, 

1 113 

for s==^ there are two states ^7^^ =+^, — ^ there 

are four states; and so on). 

The assumption of the half-integral quantum numbers was 
introduced even before Uhlenbeck and Goudsmit in order to explain 
the double splitting of terms for the monovalent atoms. The Stern- 
Gerlach experiment showed that there are two possible electron 
states in a monovalent atom; that is, the electron spin must be 
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described by the half- integral quantum numbers corresponding to 
two opposite orientations. Recalling that the Einstein-de Haas 
experiment showed that the Lande g factor in Eq. (16.29) is equal 
to two (^==2) and the intrinsic angular momentum is given by 
Eq. (16.30), we find the following expression for the z component 
of an intrinsic magnetic moment: 


niQC^^ -i-I^Of (16.31) 

which is simply one Bohr magneton. The introduction of the 
electron spin also made it possible to explain the multiple splitting 
of the spectral lines of atoms, as well as their magnetic properties. 


D. PAULI EQUATION 

A nonrelativlstic wave equation that includes the intrinsic 
magnetic moment of the electron was first proposed by Pauli. For 
this purpose he took the ordinary Hamiltonian of the Schrbdinger 
equation and added to it a term representing the interaction between 
the magnetic moment of the electron M and the external magnetic 
field H : 


FP = — p • //. (16.32) 

Then the time- independent Schrddinger equation takes the form 

{^-HS -I ^ (16.33) 

where the Hamiltonian is 

Next, it was necessary to find suitable quantities to describe 
the intrinsic magnetic moment of the electron. It is well known 
that introduction of the spin is related to introduction of a 
fourth quantum number, characterizing the internal properties of 
an electron. On the other hand, the wave function ^ of a particle 
depends only on three quantum numbers, corresponding toquantiza- 
tion of the three spatial coordinates. In or der to describe spin, 
Pauli Introduced two wave functions Wy and UJ'Jlh 'place of the single 
wV^fuhction ^ , One of the wave functions describes a state With 
one spin orientation and the other wave function describes a state 
with the opposite spin orientation. The actual wave equation 
represents a system of two equations. It is possible to represent 
a system of two or more equations, such as 
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— (16,35) 
by a single equation in matrix notation 

where the multiplication is carried out according to the rule for 
the multiplication of matrices (c) = (i 3 !)(^): namely, an element of a 
matrix product is obtained by multiplying each element in the 
appropriate row of the first matrix by the corresponding elements 
of the appropriate column of the second matrix and taking the sum 
of these products, that is. 


(16.36) 

n 

Pauli suggested selecting the wave function 'F in the form of a one- 
column matrix W and setting the intrinsic magnetic moment 

of an electron equal to 

111 = — jioO', (16.37) 

where [Aq is the Bohr magneton and & stands for the three 2x2 
Pauli matrices 





(16.38) 


These matrices are denoted by the letter a with a prime (the same 
letter without a prime will be used to denote the 4x4 Dirac 
matrices). These matrices characterize the components of the 
spin vector along the coordinate axes. 

Using the rule (16.36) for matrix multiplication, it can be 
readily shown that the square of each Pauli matrix is equal to 
unity 


(16,39) 

where V denotes a 2 x 2 unit matrix^Q . It can also be shown 
that different matrices anticommute with one another: 

o;a;= — o>; = fa;, 

<3 0 ~~ — a a —r~ to 
.1 j 


(16.40) 
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In terms of the above matrix expressions, the nonrelativistic 
Pauli equation has the form 


+ (o = (16.41) 

This matrix equation is equivalent to a system of two ordinary 
equations, each of which corresponds to one of the rows of the 
matrix 


(£ _ 11 S _ nr, _ (/y^ _ iH,) = 0. 

(£ - H S .) V, - (// , + iHy) 'r. 0. ^ ‘ > 

Let us consider the case of an electron moving in a magnetic field 
directed along the z axis — H^—O, //,=_ Jf ). Using the Hamil- 
tonian (16.19), which includes the effect of a magnetic field, we 
obtain two equations of motion for the electron 


I IT -f e„<I' — ’w — 'I'j = 0, 

I nM (16.43) 

j £ + e„<l) - - [.0-^ m W, ^ 0, 

where in is the energy of interaction between the magnetic 
field G>^'and the orbital magnetic moment, and is the energy 

of interaction between the magnetic field and the spin magnetic 
moment. In the s state the magnetic quantum number m is equal 
to zero, so that the Pauli equation takes the form 

that is, the wave function '1^ describes a state in which the Intrinsic 
magnetic moment of the electron is parallel to the z axis, and the 
wave function a state in which the magnetic moment is anti- 
parallel to the z axis. These are the orientations of the intrinsic 
magnetic moment which were observed in the Stern-Gerlach 
experiment. 

As the function Pauli suggested taking the Hermitian adjoint 
of that is, the matrix whose elements are ob- 

tained by taking the complex conjugates of the elements of W 
and transposing them (interchanging rows and columns). Thus, if 


^ 0 . 
= 0 ; 


(16.44) 
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is a column matrix, will be a row matrix. The probability 
density will be given by 

(16.45) 

which includes the possibility of two spin orientations. 

The other matrix elements are formed in a similar manner. 
For example. 




~w 


(16.46) 


that is, and represent the probability densities of states 
in which the electron has a spin orientation parallel and antiparallel 
to the z axis, respectively. Using the expression for the intrinsic 
magnetic moment in the Pauli theory 




eoti 

2moC 




and the Einstein-de Haas relation between the intrinsic magnetic 
moment and the angular momentum 


we find that 



(16.47) 


Thus, in agreement with the other experimental facts, the z com- 
ponent of the spin angular momentum is equal to iL \/_, . 

Since the spin operator is expressed in terms of the matrices 
a', the spin components do not commute. In this they resemble the 
components of the orbital angular momentum, which are operators 
depending on derivatives [see (11.75) and (11.76)]. The commutation 
relations satisfied by the spin operators can be easily established 
from (16.40) and (16.47): 


SyS, — S,Sy = i/zS,, 
— S.^S, = ihSy. 


(16.48) 



282 


RELATIVISTIC QUANTUM MECHANICS 


Concluding our discussion, we note that the Klein-Gordon theory, 
which includes the relativistic effects but neglects the spin effects, 
and the Pauli theory, which, on the contrary, neglects the relativistic 
effects but includes the spin effects, were predecessors of the more 
rigorous theory of the electron developed by Dirac, which predicts 
all the elementary properties of the electronTJiIt should be noted 
in this connection that the absolute value of fHe intrinsic magnetic 
moment was introduced in the Pauli theory from purely empirical 
considerations. 




4^ 

Zee 


rShow that in nonrelauvistic quantum mechanics, just as in the classical 
theory, the ^ee^man effect is due to the precessional motion of the orbit in a magnetic 
field, the motion having the Larmor frequency. 

Solution, From the Hamiltonian of the Schrodinger equation for the case of an electron 
moving in a magnetic field directed along the z axis 


H: 


-I 


we can find the time derivatives of the angular momentum 
i 


'dt 




T 


dt 


Iv-0, 


where o is the Larmor frequency. It follows that the component of the angular momentum 
in the field direction {z axis) is a constant of the motion. The components along the x 
and y axes, however, precess around the z axis with the frequency o , 


S^oblemlbi^ Show that the spin operator S is vectorial; that is, if we construct the 
1 inear coriiBination 


-f- YbS^i 

where {3, y are the directional cosines, then 

and so on. 


roblem 16 .33 Show that in a homogeneous magnetic field which is a function of ume 
only, lllllf W4ve mliction of the Pauli equation can be resolved into a product of coordinates 
and spin functions. What form does this solution take if the field is time independent? 
Solution, Let us look for a solution of the Pauli equation in the form 




It is readily shown that the coordinate part of the wave function (r, t) satisfies the 
ordinary Schrodinger equation without the spin 


= t), 


while the spin part of the wave function may be obtained from the equation 



MOTION OF AN ELECTRON IN A MAGNETIC FIELD 


203 


The spin part of the wave function is normalized as follows 
(CfQ) (g) = CfCt + C}C, = 1. 

In the case of a stationary magnetic field it is easy to determine the time-dependent 
part in the above equations. We simply set 



+ (r, = f " i;(r). 


Then the Ume-lndependent parts of the wave function are determined from 


= t. 



Find the eigenvalues of the operator of a spin component along the 
direciiuii SpSCUIfifl by the spherical angles d and «p. Investigate the particular cases In 
which this direction is the A', y or z axis. 

Solution, Consider first the case in which the spin is directed along the z axis. Then 
the imtial equation takes the form 


where 



This matrix equation is equivalent to two homogeneous algebraic equations 


lc,-AC,=0, 

-g- Cl -|- ACa = 0. 


ITie normalized solutions of these equations have the form 


X 



and X r= 



The first evidently corresponds to the case in which the spin is directed along the z 
axis; the second to the case in which the spin is directed along the -z axis. 

The operator for the component of spin along the direction defined by the si^erical 
angles 0 and cp with respect to the coordinate axes is equal to 


S = sm ft cos S;p + sin ft sin y + cos ft S^, 


where 


2 \1 0,1- 2 [i Oj’ 2[0 - ij- 
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Hence from the equation 


S 




we find two solutions: 

(a) the soluuon corresponding to the case in which the spin is parallel to this direction 




cos - 


1 \ 


i) » 

\ 2"'7 


(b) the solution corresponding to the case In which the spin Is antiparallel to this 
direction 

/ \ 

“ ’ tos j ‘ 

Setting = 0, cp = 0 we obtain the same soluuon as above, llie cases in which the spin is 
directed along the x or y axes may be obtained by setting, respectively. 


» = — .¥ = 0 or »=^-, ¥ = -. 


VjProblem 16,5, j The electron spin Is parallel to the ^ axis. Find the probability that 
the 6'Wiionent of the spin (a) In a direction parallel to the i axis, and (b) in a direction 

making an angle D with the z axis, will have the values -ly h and — ^ li. 

Hint, Take the wave function describing the state in which the spin is parallel to the 
z axis and then expand it in terms of the functions corresponding to the cases In which 
the spin is parallel and antiparallel to the direction forming an angle with the z axis. 
Both these functions are given in Problem 16,4, Without loss of generality we may set 
the angle cp = 0, 

llien the squared modulus of the expansion coefficients gives the probabiliues 
“'i/a = ~ , w_,,^ = sin- ^ 


of the components of the spin along the corresponding direcuons; these are equal to 



In order to find the r component of the spin, we must set d = in the last equations. 
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The Dirac Wave Equation^ 


A. LINEARIZATION OF THE ENERGY OPERATOR. 
DIRAC MATRICES AND THEIR RELATION TO 
PAULI MATRICES 


As indicated in Section 15, relativistic quantum mechanics is 
based on the well-known relativistic relation between the energy E, 
momentum p, and rest mass nia: 

E=cVp^-\-mlc\ (17.1) 

To obtain the wave equation describing a free particle, we substitute 
the appropriate operators into this equation 

E = p== — /ftv. (17.2) 


and act with these operators on the wave function. It is impossible, 
however, to make a direct transition to operators in (17.1) because 
we^ nflnnnt thp _a^pn_of_th^diiferentialjpj)gr_^^ 

the radicals^rjt is therefore necessary to get rid of the square 
root~in^X^Trry7^0ne^ way of doing this is to take the square of Eq. 
(17.1). This gives the relativistic Klein-Gordon wave equation with 
a one-component wave function.^ As already noted, this equation 
describes the motion of spinless particles and is not applicable to 
electrons, whose spin is equal to 1/2 (in units of ft). 

A different method of obtaining a linear relativistic wave 
equation was adopted by Dirac (1928). This method gave a first- 
order wave equation and consisted in linearizing the relation 
(17.1). It led to the discovery of the relativistic wave equation for 
the electron. This equation plays a fundamental role in relativistic 
quantum mechanics and quantum field theory since it provides a 


^ See PAM Dirac, Principles of Quantum Mechantvi, New York Oxford University 


Press, 1958 
2 


More exactly, we in fact have a function with two components and \p 2 - 

cot 


since a second derivative with respect to time appears in the fundamental equation One 
degree of freedom corresponds to particles with positive energy, the other to particles with 
negative energy It was shown by Pauli and Weisskopf that the negative energy states can 
be eliminated by carrying out a second quantization of the scalar equation and introducing 
spinless particles with charges of opposite signs 
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suitable description of the motion of particles of spin 1/2. The 
discovery of this equation was the most important advance in the 
theory of the electron since the Maxwell- Lorentz equations of 
classical electrodynamics, Bohr^s semiclassical theory and non- 
relativistic quantum mechanics servedonly as transitional theories. 
The relativis tic relation between en er gy and monie ntum is 

1 inearpESl^ J/ext^tin^M^ poly- 

noinial with .the aid of m^ices. For this purpose we represent 
( 1 fTlfTrTTKe" f or m^ 

3 

E = c = c (17.3) 

IX -rO 

where 

Pu = mor, Px = Px. Pi^Py^ Pi = Pz‘ (17.4) 

We note that 

3 

E‘ = c^ 2 Pf.Pf. = c’‘{p* + mlc-). (17.5) 

,i-0 

To determine what conditions the quantities must satisfy, we 
square both sides of (17.3). Then, if the operators p^, and Pjx' com- 
mute, we have;J|||^ 

E‘=c‘ == Y + (17.6) 

\L (X' JX (X' 

Equation (17.6) coincides with (17.5) only if 

^{x^|Jl' *4“ (17.7) 

that is, all four quantities % anticommute with one another 

a,xV + V^,x = 0 (17.8) 

and the square of each of them is equal to unity 

(17.9) 

We recall that the 2x2 Pauli matrices also possess analogous 
properties 



(17.10) 


h ^These operators commute with each other if there is no electromagnetic field There- 
fore, Dirac proposed that one should first extract the square root of the operator for a free 
particle, and then generalize the resulting equation to the case when fields are present 
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since they anticommute [see (16.40] and the square of each is equal 
to unity [see (16.39)]. To extract the square root of the four-term 
polynomial, however, it is necessary to have four relations (17.7) 
{H = 0, 1, 2, 3), instead of three [Eqs. (16.39) and (16.40)] that are 
satisfied by the Pauli matrices. 

Accordingly, Dirac proposed that we take a system of 4 x 4 
matrices and 9n that are related to the 2x2 matrices by the 
expressions 



where are the Pauli matrices 


0' = 


0 0 
0 0 


and V= 


1 0 
0 1 


Hence we find, for example, 


(17.11) 

(17.12) 




/O 1 0 Ox 

/ 1 0 0 o\ 
0 0 0 1 
\0 0 1 0 / 


and so on. 


The properties of these matrices are similar to those of the 
Pauli matrices, as may be easily checked by direct multiplication. 
In particular, it turns out that their squares are equal to unity 

c- = p‘ = J (17.13) 

or, more exactly, are equal to the 4x4 unit matrix 


/I 0 0 O' 
(0100 
0 0 10 
\0 0 0 1 > 


(17.14) 


As in the case of Pauli matrices, we have 

oioj = — 02O1 ^ „ p,p^ == — p^p, == ip., and so on. 

^nPn' = pn'<^n (n. n'— 1, 2, 3) V • / 

From this it follows that the different matrices a anticommute with 
one another (a similar conclusion is also true for the system of p 
matrices): 

= 9n?n’ + ?n'9n == (17.16) 

The matrices and P^', however, commute with each other. 
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Dirac proposed that the matrices [see Eq. (17,3)] be 
chosen as follows: 


= = ^,) («=1. 2, 3), 

__/r 0'\ 

*— \o' — rj- 


(17.17) 

a„=rp, = (-, (17.18) 

In conventional notation these 4x4 matrices have the form 



/O 

0 0 


/o 

0 

0 - 



(0 

0 1 


u — 1 0 

0 

L 

o\ 

— 

io 

1 0 


’ “'“lo - 

-1 

0 

0 ’ 


Vi 

0 0 

oy 

'1 

0 

0 

0 / 

^0 

0 

1 

On 


/I 

0 

0 < 

0 

0 

0 - 

-1 

\ 

/ u 

1 

0 

1 

0 

0 

0 

1 0-0 P.l ^ 

(0 

0 - 

-1 

.0 - 

-1 

0 

oy 

/ 

\0 

0 

0 — 


V 


(17.19) 


v/ 


Multiplying the above matrices by one another, it is easy to show 
that they satisfy the relations (17.7). 


B. THE DIRAC EQUATION. CHARGE DENSITY 
AND CURRENT DENSITY 

Let us substitute the corresponding operators into the linearized 
relativistic relation (17.3) between the energy and momentum. We 
obtain the Dirac equation for a free particle** 




(17.20) 


where the operators E andp are, as usual, equal to 

P = — ‘^v. 


^Because of the four components of the wave function each state can have either 
positive or negative energies (see below) and two directions of the spin (see Chapter 18) 
In the classical case the relation (17 1) between the energy and momentum can be 
represented m a form similar to (17 20 ) 

E -V • p — \l^~ - 0 

This equation is easily verified if it is remembered that for a free particle 
f,. and p--^ 

Consequently the matrix « * u e must play the role of the velocity, while P 3 is 

a scalar that characterizes the Lorentz contraction 
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and the Hamiltonian H is given by 

H = r(a • p) + (17,21) 

Since a and are 4x4 matrices,* the wave function ^ must have 
four components, which we combine to form a single- column matrix 





(^. 22 ) 


The complex conjugate of this function is understood to be the 
Hermitian adjoint, that is, the row matrix 

(17.23) 


The Dirac wave equation (17.20) is therefore equivalent to a 
system of four equations 

(E — (j;, — C (P;, ~ jpj,) 'K — 0, 

(E — OToC') — c (p, + «p„) i}i 3 + cp.ipi = 0, 

(E OT„C'‘) ta — c (p^: — ip,,) ~ = 0, 

(E -t — c (p^ -1- JPv) + cp A = 0- 

In the case of motion of an electron in the electromagnetic field 
specified by the given vector and scalar potentials A and a>, we 
can still use Eqs. (17.20) and (17.24), but the energy and momentum 
operators have to be generalized in accordance with the general 
laws of quantum mechanics: 

F = — e®, P = — im — jA^ (17,25) 

The complex conjugate of the wave equation may also be repre- 
sented in the form of a single matrix equation 

<1<^(F — c(o- P) — p3m„c’) = 0, (17.26) 


(17.24) 

) 


where the action of the operators iH and —iHV on the wave 

function which is on their left should be taken to be the same as 
in Eq. (17.20) but with opposite sign, that is, 

— = = (17.27) 

For a free particle, Eqs. (17.20) and (17.26) now become 

'l' + (o • V) i]) — P3m„c*'ii =0, (17.28) 

in ^ <1.+ + . o) + m„cVp., = 0. (17.29) 
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Multiplying (17.28) on the left by and (17.29) on the right by 
and adding them, we obtain 

~c dt (17.30) 


which may be interpreted as the equation of continuity for the 
probability density p and the current density j\ 

-|^-p+V-y=0, (17.31) 


where‘s 




(17.32) 


If we write the last equation in terms of components of the wave 
functions, rather than in terms of matrices, we obtain 


+%*'!'»+ (17.33) 

that is, po IS a matrix consisting of a single element (it is just a 
number). In exactly the same way it is readily shown that 




000 
0 0 1 

010 0 /\'^ 
\\ 0 0 oy vpi 





\ 


92 / 


(17.33a) 


We note that, contrary to the Klein-Gordon theory, the density 
p„ is a positively defined quantity. This does not mean, however, 
that in the Dirac theory p„ can be considered the particle 
density. Just as in the Klein-Gordon theory, there will be particles 
with a sign opposite to that of the electrons (positrons). From 
Eq. (17.32) it can again be concluded that ca should be regarded 
as the velocity operator. 


'’Similar lelations will also hold in the rase when a field is present 

*^In second quantisation, the definition of pQ as a positive quantity means only that 
Fermi statistics should be applied to the particles (for example, in the case of the Dirac 
equation), if Pq may take cither positive or negative values (for example, in the case of 
the Klein-Gordon equation), then Bose statistics should be applied to the particles 
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C. TRANSFORMATION PROPERTIES OF THE WAVE 
FUNCTION UNDER LORENTZ TRANSFORMATIONS 
AND SPATIAL ROTATIONS 


According to the general principles of the special theory of 
relativity, physical laws must be independent of the choice of the 
Lorentz frame of reference. Therefore, the Maxwell equations, 
the Klein-Gordon equation, and the Dirac equation must all be 
invariant under the Lorentz transformations. Let us investigate 
the transformation properties of the Dirac wave function. The 
Lorentz transformations can be written as 


— xsinhi', cosh T — r/ sinh -f, y, (17.34) 

where 


cosh ^ 


7= , sinhY — -- ^ ™ 


This transformation must be satisfied by all four-dimensional 
vectors, including, in particular, the charge and current densities 

cp' = cpcosh 7 — /vSinh^, = cosh 7 — cpsinh7, ^ (17.35) 

The definition of these quantities, according to the Dirac theory, 
gives 

= f]^''(cosh 7 — a, sinh f) 'J; = 

(aj cosh 7 — sinh 7) f (17.36) 


Here we have used the fact that = cosh 7^1 - sinh 7a, = cosh 
7 — a, sinh 7, since af' = 1, aj, where // is an integer. In 

order to satisfy the above relations, we must set 


(J;'™^cosh — o»,sinh = c 

cosh 1 — 7, sinh ^ 


Then, since 


oL^e 


e 


J 

2 ' 







(17.37) 


(17.38) 


it is easy to show that the relations (17.36) are correct. From 
(17.37) it can be seen that the wave functions do not transform as 
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vectors (whole angle 7) or as tensors (double angles 7), but as 
spinors, the transformation of which is characterized by the angles 
7 / 2 . Spinors arft alsp called tensors of rank 1/2, 

In a similar manner'^ "Kay"T)e' I'he transformation 

law of a spinor under an ordinary spatial rotation (for example, 
around the 2 axis by the angle <p) is as follows: 










-in - 


(17.39) 


The above relations follow from the transformations for the cur- 
rent vector: 


cos cp + /y sin <f, 
h — jyCos'f — i\sln'f, 

jz = i 


(17.40) 


these transformations are represented in the Dirac theory as 


f (*1 cos <P + cti sin cp) ip, (17.41) 

(Jj''a,'p' = i{)+a 3 il( and so on. 

By substituting the values for ij*' from (17.39), and using the fact 
that 


<»•- 


a,e 




COS “- + ^^3 Sin 


9 

cos-|- 


?o,, sin 








“3. 


we can obtain the relations (17.40). 



(Ihiipter IH 


The Dirac Theory of the Motion of an Electron 
in a Central Field of Force 


A. ORBITAL, SPIN AND TOTAL ANGULAR MOMENTA. 
CONSERVATION LAWS 

We shall determine the angular momentum of an electron from 
the conservation laws characterizing the motion of an electron in 
a central field of force: 


V = e*&(/-) (18.1) 

(for example, an electron moving in the Coulomb field of a nucleus 

V — ^ 1 . It was shown in the nonrelativistic Schrodinger theory 

that the orbital angular momentum is consei ved in a central field 

L==rxp. (18.2) 

In the Dirac theory, however, which takes into account the electron 
spin, the component of the orbital angular momentum does not 
commute with the Hamiltonian 

H=c(a.p)-f p,m„c*4K(r) (18.3) 

and therefore it is not a constant of the motion. Indeed 

HL,- L,H = f- (a,p^ - a,,p,) 0. (18.4) 

In order to generalize the law of conservation of angular mo- 
mentum to particles having spin, we shall use the relation 

Ha,-c,H = j(a,p,-«.p^). (18.5) 

It follows from this that the operator 

J = L + -^«» = L-f-S 


( 18 . 6 ) 
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commutes with the Hamiltonian operator H and thus is a constant 
of the motion. ^ 

This result may be interpreted in the following manner. The 
electron has an intrinsic angular momentum (spin). We have just 
found that only the total angular momentum is conserved (the 
orbital angular momentum plus the spin). The orbital angular 
momentum in the s state is equal to zero, and therefore we have 
here the law of conservation of spin angular momentum. For the 
square of the spin we obtain 

S' = L I „5) = s (s 1 ) ft' = A ft\- (18.7) 

that is, the electron spin takes half- integral values 5 = 1/2 (in 
units of h) 


B. PROPERTIES OF THE TOTAL ANGULAR MOMENTUM 
OPERATORS. QUANTIZATION OF THE TOTAL ANGULAR 
MOMENTUM. VECTOR MODELS 

We shall now show that the operators for the components of the 
total angular momentum in the Dirac theory satisfy the same com- 
mutation relations as the operators for the components of the 
orbital angular momentum in the nonrelativistic quantum theory 
(see Chapter 11). It can be seen that the operators L and S com- 
mute with each other, because they act on different variables. 
Therefore, 

J J,J , (L, SJ (L, + S,) - - (L, + S,) (L, + SJ = 


and 


tJ y J JC I ^ J *f 

( 18 . 9 ) 
J 

The last two relations are obtained from the first by cyclic per- 
mutation of the coordinates 

y->Zy z->x. 


^ Since only the total quantity (18 6) is conserved, the separation of the angular mo- 
mentum into spin and orbital parts is not rigorous in the general case This separation is 
found to be possible only in certain special cases (see Chapter 20) 
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The operator of the square of the total angular momentum is seen 
to be 


= (L + S)^ = -f 2 (L.S, + L„S, + LA). (18.10) 

which commutes not only with the Hamiltonian but also with any of 
its components, for example, with the z component: 

(18.11) 

By analogy to the orbital and spin angular moments, we conclude 
that the square of the total angular momentum and one of its com- 
ponents (for example, J^) may have simultaneous eigenvalues. 

The quantization rules of the total angular momentum may be 
found from the quantization rules of its orbital component (for a 
spinless particle): 

= ), L.^tun, -\-l) (18.12) 

and its spin component (for example, for 1-- 0) 

S,= hm,, V,) (18.13) 

This problem can be solved exactly in general form in the Dirac 
theory. For the sake of simplicity we shall, however, solve it in 
the Pauli approximation, that is, taking into account the spin of 
the particle and assuming that the particle itself is nonrelativistic. 

If the particle moves in a central field of force, the components 
of the wave function 


( 18 . 14 ) 

which obeys the Pauli equation (see Chapter 16), can be related by 
means of the law of conservation of angular momentum 


J* 






(18.15) 


where L== r x p is the orbital angular momentum operator and u' 
are the two-component Pauli matrices. We shall look for a solution 
of the system of equations (18.15) in the form^ 




(18.16) 


^In this particular choice of a solution, only the square of the orbital angular momen- 
tum IS conserved, not its projection on the z axis. 
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where Kj” are the spherical harmonics (see Chapter 11). Then, 
since 

(18.17) 

Eqs. (18.15), (18.12) and (18.13) give 

' (,:l)(J;) =[/(/+ 1) -/(/+!) 

or 

l[(L,-£L3,)yr,+ L;F,]=</’F.. 

J-[(L, + /4)‘F,-L,qr,l=9'Fj. 

where 

</=-/(/-(- i)-^((+i)-|-. 

Using the relations (11.87) and (11.88), we have 

UYT = ~i1t YT = mllYT, 

(L, dr lY,) YT = -h {T^ Yf-'. (I 8 . 2 O) 

Then, taking Into account (18.16), we may write Eq. (18.18) as 

(c/ m+ l)^i + K(^+ 1 — m)(/ -^m)C.2 = 0, 

]/(/-!- 1 ^-m)(f-Vm)C, y(q-\^m)C, = 0. 

From the requirement that the determinant of the system be equal 
to zero, we find two values of q corresponding to the two possible 
types of solution 

q^l, / = C,=.- (18.22) 

q = -{l+\), / = /-'/». C^. (18.23) 

The coefficients C, and which determine the relationship^ be- 
tween the spherical harmonics in the sum of the orbital and spin 
angular momenta, are called the Clebsch-Gordan coefficients. 


(18.18) 


(18.18a) 


(18.19) 


3 

We note that this relationship between the spherical harmonics is obtained only in the 
case of spin-orbit interaction, which we have taken into account with the aid of the re- 
lations (18 IS) If there is no interaction, the two solutions will be completely independent 
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Using also the normalization condition C; + C“= 1, we may write 

the first type of solution when /=/ + -i, /=:0, 1, . in the fol- 

lowing form: 




-xf y"*-* 

^ M ' _v^(y-^+V2) 

— i I m 

1 / I \ — m w/n 

-y 


(18.24) 


For j = l /= 1, 2, . . . (the second type of solution), the wave 


function has the form 





— m + 1 
2 / 4-1 


yrn^l 


V 


I ^ ffi 
2/+1 


ym 



(18.25) 


where K/'L are the so-called spherical spinors. The orthonormality 
condition for the spherical spinors can be written as 


I d<2 Y\^„ = V , (18.26) 

where j = l V 2 corresponds to the case in which the spin and orbital 

angular momenta are parallel, and j = l — case when they 

are antiparallel. This condition follows Immediately from the fact 
that the spherical spinor is a single-row matrix and from the 
orthonormality condition for spherical harmonics. The spherical 
spinors (18.24) and (18.25) are spinor generalizations of the ordinary 
spherical harmonics (see Chapter 11) and represent the angular part 
of the solution for all problems involving the motion of a particle of 
half- integral spin in a central field of force. 

Substituting these solutions for the function 'F into (18.15), we 
find that the component of the total angular momentum takes the 

value = where the quantum number nij is equal to nij = tn — ~^ 

For the first type of solution (/ = /+ — ), it can be seen from (18.24) 

that it ranges from — / (mj = — I = — /) to 1(^/==^ -h^==/)* 
since the coefficient at the function Fr ~ which does exist jfor the 
last value of m, vanishes. In exactly the same way the number m 

in the second type of solution (/ = / — \) ranges from ^ 
(mj = ~j) to / = 

Thus our results can be summarized as follows. The square 
of the total angular momentum has the eigenvalues 


l± 




2 * 


1^0, 
i = 0, 


(18.26a) 
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that is, it is quantized similarly to the orbital angular momentum, 
except that the quantum number /, which is called the total 
angular momentum quantum number, takes half- integral values. ^ 
The eigenvalues of the component of the angular momentum along 
any axis are also characterized by half- integral quantum numbers 

J^ = 1lnij, = +/' (18.27) 

From the relations (18.6) and (18.7) and the quantization rules 
(18.26) and (18.27), it is easy to obtain quantization rules for the 
scalar products L • S and J • S, which are important in spectros- 
copy 

L. S =j(P-L’-S^) = y{/(/-f l)-/(/+l)-s(s+l)K (18.28) 
and by analogy 

J- S + {/(/ -I- !)-/(/+ l) + s(s fl)}. (18.29) 

We shall consider here the vector model of the addition of angular moments. In 
spite of the lack of mathematical rigor of this model, it enables us to resolve a number 
of complicated questions and often gives accurate results. 
As we know, the orbital angular momentum does not have 
a specific direction in space in quantum mechanics. The 
absolute value (square) of the angular momentum and one 
of its components, for example, along the z axis, have, 
however, simultaneous definite values. These facts can be 
represented geometrically by an angular momentum vector 
that describes a cone about the z axis. The projection of 
the angular momentum on the z axis will then have a well- 
defined value, whereas the projections on the x and y axes 
remain indeterminate. These arguments apply with equal 
validity to the spin angular momentum since it has the 
same commutation properties as the orbital angular mo- 
mentum. The spin and orbital angular momentum vectors 
are oriented in space in such a way that their sum forms a 
vector J that is constant in magnitude. Thus, the vectors 
L and 5 do not have arbitrary directions; they process about 
J like two coupled gyroscopes. 

The dimensionless quantities y*, /*, and s* are drawn 
In Fig. 18.1, Each of the vectors I* and s* is defined on 
the surface of a cone. They '*precess'* around J* like a 
coupled system. We note that, according to Eq, (18,26), 
the addition of the vector /* (/= 0, 1, 2, , . and the 
Fig. 18.1. Addition of the vector s* ($=^ 73 ) leads to the total angular-momentum 
spin and orbital angular vector j* with half-integral values of the total angular mo- 
moments. mentum quantum number j = I ±. Va. 

The number / is also called the internal quantum number. This number was introduced 
by spectroscopists before the discovery of spin on a purely empirical basis. It expressed 
certain internal properties ot particles that were still unclear at that stage. 

''The lack of mathematical rigor lies, for example, in the fact that the square of the 
vector j IS equal to }(j t 1 ) rather than to 

^From the standpoint of the classical theory, this coupling can be interpreted as a 
coupling of the orbital motion by the magnetic field. 





C, MOTION IN A CENTRAL FIELD OF FORCE INCLUDING 
SPIN EFFECTS. THEORY OF THE ROTATOR 

If we wish to investigate the motion of a particle in a central 
field in the nonrelativistic approximation with the inclusion of 
spin effects, we must use spherical spinors Y\^\n characterizing 
the states in which the total angular momentum (orbital plus spin) 
is conserved, instead of the spherical harmonics describing 

the states where only the orbital angular momentum is conserved. 
Since the spherical spinors (inthe nonrelativistic approximation) are 
composed of spherical harmonics having the same quantum number /, 
we obtain the same radial equation as fora nonrelativistic spinless 
particle, that is, 

VrR + +--^) R - 0. (18.32) 

For the wave functions of an electron in a central field we obtain 

xv\i^RY\l, (18.33) 

where the spherical spinor Y\m is defined by the expressions 
(18.24) and (18.25). 

In particular, for the rotator we may set r = a= const and the 
radial part of the wave function R^l, It is seen that the spin 
effects in the nonrelativistic approximation do not give any ad- 
ditional terms for the rotator energy, which will be given by the 
same expression (12.7) as for a spinless particle, that is. 

The wave function will be given by the spherical spinor Y)^m\ 
therefore we must find the selection rules for the quantum num- 
bers I , nij and j . These selection rules hold not only for the 
rotator, but also for any problem of motion of a particle in a 

central field of force (for example, the electron in the hydrogen 

atom). 
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In place of Eq. (12.19), from which the selection rules for 
spinless particles were established, we now have 

} (V W'Y qY', '^ d2, (18.35) 

where q may have three values 

< 5 ^ = 2 ! — cos 0 , q = x (18.36) 

(for simplicity let us take the radius of the rotator equal to unity: 
a— I). If in place of the spherical spinors we substitute their 
values from (18.24) and (18.25), the matrix element (18.35) be- 
comes 


{q)'!m/ 




f (YT'~y qVr ~ ' d'J h C'V) <f {Yfy qYT dH (18.37) 


From this it is seen that the two integrals in (18.37) agree exactly 
with the integrals in Eqs. (12.19)-(12.22). The selection rules for 
the quantum numbers I and m will therefore be the same as for 
a spinless rotator, that is, 

Mr=l --l'=±\, Am = 0 ((/ = 2 :), Am=rtl (ij = x-±ty). 

We shall now find the selection rules for the quantum numbers 

tUj and / Since nij is related to m by the relation nij^ni — 

for both types of solutions, the selection rules of nij will be the 
same as for //i, that is, 

Am^=-0, ±1. 

To determine the selection rules for / let us consider first the 
case in which the transitions occur between states characterized 
by the same type of solutions (/' = /'-)- */ 2 ~> y = V 2 f — ^^1 1 ~> 

— It follows from (18.24) and (18.25) that the coefficients 
and are always positive and therefore such transitions 
are always allowed. In this case the possible change of j must 
be the same as the change of the orbital quantum number /, that is, 
Aj = M = ±: 1 

If the transitions occur between states characterized by different 
types of solution (f = 1' = l — or / = /' — = 
then by taking into account A/==itil, we obtain three possible 
values for A/ = 0, +2, -2. Here, however, we must consider the 
fact that the coefficients and have different signs. For 
A/ = ±:2 the two terms cancel each other, so that this transition 
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is forbidden. For A/ = 0 the difference between the two terms is 
not zero, but owing to the fact that the two terms occur with 
different signs, the intensity of the radiation will be weaker 
than in the case of transitions between states characterized by 
the same type of solutions, when A/ = -±: 1. This can be shown with 
the help of a specific example. Let us suppose that the initial 
state is / = /— 7^ and the final state is / = 79 . We shall 

consider the case Am = m' -m = 0. Then, using (12.22), we re- 
duce the appropriate matrix element (18.37) to the form 

= 8,,, ,+ , ID'/'. {/, m - 1) + C'/. nA {/, m)] + 

-f 8,.,,_,[D'/’./'D(/. + m)], 

where Vi and — Substituting the expressions 


DO' I- >/3. y = / ~ -7=—' --■== — tti 


and 




I 

/(2/-inH2/-fT) 


1 ni Y I ■+m 


from (18.24) and (18.25), and the expressions for A and B from 
(12.22a), we find that the coefficient of vanishes; that is, 

the transition A/ = — 2 is forbidden. At the same time, the co- 
efficient of does not vanish, that is, the transition A/ = 0 

is allowed, but the intensity of the lines is weak in comparison 
with A/ = nh 1. In a similar manner it is easy to show that the 
transition A/==2 is forbidden not only for q=r-z, but also for 
q = x±iy (Am=±-I). 

In accordance with the above discussion, the selection rules 
for the quantum numbers which characterize the state of a particle 
in a central field of force, when the spin is taken into account, 
have the form 


A/=:±:l. Amy = 0, drl, 



(normal intensity), 
(weaker intensity). 


(18.39) 


D. PARITY OF A STATE 

In connection with the formulation of the law of conservation of 
angular momentum for Dirac particles, we shall now define more 
clearly the meaning of the quantum numbers / and j in the Dirac 
theory of a particle in a central field. In nonrelativistic quantum 
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theory, the orbital quantum number / is associated with the square 
of the angular momentum L“==/!iV{/ + 1)» which is a constant of 
the motion; therefore in both Schrodinger’s and Pauli’s theory 
the quantum number I represents a quantity that is constant in 
time. 

In the Dirac theory the orbital angular momentum does not 
commute with the Hamiltonian, and therefore it is not a constant 
of the motion. Consequently the quantum number I has only an 
approximate meaning when used in connection with the law of 
conservation of angular momentum. 

It turns out, however, that I characterizes an additional property 
of a particle in quantum theory, namely, the parity of a state. By 
the parity of a state we mean the behavior of the wave function 
with respect to space inversion: 

— — z=z — z\ (18.40) 

The parity operator is defined as follows ’ 

that is, it reverses the signs of the space coordinates. The eigen- 
values X of this operator may be found by applying this operator 
twice: 


= (18.42) 

This double application of the parity operator leaves the co- 
ordinates unaltered. From (18.41) and (18.42) it follows that 

X = =t:l; (18.43) 

that is, either the wave functions remain unchanged with respect 
to space inversion (even functions, X== 1), or they reverse their 
sign (odd functions, X =— 1). 

We shall now find the quantities which determine the parity of 
a wave function in a central field. In the spherical coordinate 
system r, D, <p, space inversion affects only the angular part 

/ = r, = — = + (18.44) 

as can easily be seen from the fact that the sign of the coordinates 

A'=rrsin ^cos cp, y=rsin^sin cp, e = rcos^ (18.45) 


7 

This operator converts a right-handed system of coordinates into a left-handed system, 
and vice versa In the Dirac theory I0(r) - P 3 0(— r). 
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changes. The radial part of the wave function remains unchanged 
with respect to space inversion, whereas the angular part changes 
in accordance with the relationship 

= ir + 9)==constPr(---cosO)e**«*(v + ^)=== 

= (18.46) 

because 

Pf* (_ ;f) = (_ 1 )/ mp,"* 


Thus the parity of a state in a spherically symmetric field is deter- 
mined by the parity of the number I 

Furthermore, it can be seen that the Hamiltonian in a central 
field remains unchanged under space inversion; therefore, the in- 
version operator I and the Hamiltonian operator H commute with 
each other. It follows that the parity of a state is a constant of the 
motion, since 


1^1= i(ni-IH) = 0. (18.47) 

The law of conservation of parity has no classical analog, unlike 
the other conservation laws (energy, momentum, angular mo- 
mentum). Consequently, in nonrelativistic wave mechanics the 
number / characterizes two constants of the motion: the square 
of the angular momentum and the parity of a state. In the Dirac 
theory, the number / does not have the significance of the square 
of the angular momentum, but the relationship between this num- 
ber and the parity of a state is preserved. 

We shall see later that parity plays a particularly important 
role in the physics of elementary particles. All wave functions— 
whether for one or more particles— can be classified as functions 
of odd or even parity. Dipole transitions can occur between states 
with different parities. In the case of two or more particles, the 
parity depends on the total spin of the system and also on the type 
of statistics obeyed by the particles. These concepts will be 
analyzed in some more detail in the treatment of specific examples. 


E. SOLUTION OF THE DIRAC EQUATION 
FOR A FREE PARTICLE 

Let us consider the motion of a free particle of spin 1/2 and con- 
stant momentum such as, for example, an electron. Without loss of 
generality we may take the z axis to lie along the direction of 
momentum; that is, in Eq. (17.20) we set 


P* = P>=0. P»7>i=0. 


(18.48) 
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The Dirac equation then takes the form 

di + Oz ~ 'i' (18.49) 

We shall look for a solution of this equation in the form 

^ (18.50) 

where L* is the normalization volume, and the wave number k has 
the same value as in the nonrelativistic Schrbdinger theory, 
namely, 


= 1, 2, .. ), 


and the 4x4 matrix 



satisfies the normalization condition 

b^b = b^bt + bib, + bi b, -}~blbi==\ (18.51) 

To determine the quantity e and the coefficients we use the 
system of equations (17.24), setting 

We then obtain 


(e/C - ko) hi - kb, 0, (e/C — k,) b, + kbi^ = 0. 
(s/C lk,;)b,~kbi^Q, = 


(18.52) 


Hence we find two values for the quantity e: e= l (the energy of 
the electron is positive) and s = — 1 (the energy of the electron 
is negative); while we obtain four values for the matrix 6, the 
components of which satisfy the normalization condition (18.51). 
Two of the values of the matrix b refer to states with e = 1: 
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and two of them refer to states with s = — 1 : 




1 

|/T 




(18.54) 


that is, the states (18.53) differ from the states (18.54) in the 
sign of the energy. 

To determine the physical significance of the states with 
different i= 1, 2, 3, 4, let us find the projection of the spin on 
the direction of motion, that is, on the z axis. First of all, we 
note that, since — /yp^ = 0 when a particle is moving 

along the z axis, the projection of spin on the z axis must be con- 
served. This follows directly from the fact that the matrix a, 
commutes with the Hamiltonian in (18.49). 

We can find the eigenvalues of this operator by applying 
it to the spin functions 6 “^ We have then 


1 

2 - 0 , 



n 0 0 
1 0 — 1 0 
0 0 1 
\0 0 0 



(18.55) 


that is, for this solution the eigenvalue of the operator ‘/s equals 
Va . In exactly the same way it is easy to show that 






Thus, the four possible states correspond to the four possible com- 
binations of the sign of the energy and the spin direction. The solu- 
tion corresponds to positive energy (£= 1 ) and the projection of 
this spin along the positive z direction (s = 1), In a similar way we 
have £=:= 1 , s:=— 1 for the solution //‘^'and we have e = — 1 , .s = 1 
and £— — 1 , s — — 1 for the solutions Z;‘*^andfe'‘^, respectively, where 
s is double the projection of the spin in the direction of the mo- 
mentum. 

In the nonrelativistic limit (v < c), the wave functions (]>3 andt );4 
will be of the order of times the wave functions t[>i andtpg 

^or positive energy states (£= 1). For states with nega- 
tive energies [see (18.54)], on the contrary, ^ 3 . 
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We have considered the special case of motion of a particle along the z axis. This 
does not restrict, however, the generality of the investigation of the general motion of a 
particle. Whenever the direction of momentum is characterized by spherical angles ^ 
and 9 , it is always possible to choose a primed coordinate system In such a manner 
that the z* axis is directed along the momentum. Then, by carrying out two rotations, 
one rotation through an angle d around the y' axis directed perpendicularly to the zz' 
plane, the other, second rotation through an angle 9 around the z axis, we may transform 
from the soluaon in the primed coordinate system (momentum along the 2 ' axis) to the 
general case (direction of momentum characterized by the spherical angles 9 ). 

Using the fact that under a rotation of the coordinate system the wave function 
changes in accordance with (17.39), we may write the solution for this general case 


I - v -I- Oja) I 
-2 


('--i 


,)■/. \ 




£ Ki+S 




ikr 


(18.56) 


which is a gen eralization of Eqs. (18.50), (18.53) and (18.54). 

d 

-Pi^ nhlem 18.1. 3 ^ow that the energy and momentum operators ^ ^ p = — ifiV^ 

respectively, tranSorm like a four-vector under the Lorentz transformations; 




P^4-' 


where 




0 ® Cl ^ 

p = so on. 


Hint. Use the Lorentz coordinate transformation [Eq, (17,34)] and change to new 
variabl es in the process of differentiation. 

18,2rV *rove the relativistic invariance of the scalar equation for a free 

particle. 

Hint. First, let us prove the invariance of the operator relation 
E'* — c*p'* =£2— cV 

by using the results of the preceding problem. 


,,.^robleml8j3^ that in the case of spatial rotation of the coordinate system 
aroiuia 'nib axisby the angle » , the wave function transforms according to the relation 




= ( y + faj sin = < 




Hint. Use the method which leads to the relation (17,39). 


Pro] 


fln( 


With the aid of the vector model of addition of angular moments, 

le angles between J* and s* and between j* and I’* taking into account the geo- 


metric vector addition in quantum mechanics; that is, find 

cos (/♦s*); cos 


6 


These problems also refer to the 


material in Chapter 17 
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Hint, Use a method similar to the one that led to Eqs. (18,30) and (18,31X 


roblem 18,^ Show that the wave function where C = /aap8, and 4>*is 

the compiyx Conjugate (but not the Hermiaan adjoint, that is, not of the Dirac wave 
function for an electron with negative energy satisfies the Dirac equation with posiave 
energy and opposite (positive) sign of the charge, that is, describes the motion of a 
positron (a charge conjugate transformationX 
Soluuon, The Dirac equation is 

+ -(7S-7''-)| 

for the complex conjugate of this equation, we may write (taking into account the fact that 

a; = ai, a* — a.., a j = = p«): 

{t I - ^ h - ““ (t 1+ 7 -I- 

+ “• (t S ~ ‘’’'""'■“I 

We note that the complex conjugate differs from (the Hermitian conjugate): namely. 


<];* = 


whereas 


Let us substitute = risp®'!^* into the complex conjugate of the Dirac equation. We 

then find that ^ satisfies the Dirac equation if the charge e Is replaced by - e. 

-i\B\ , 

* * 

Since the state t) = e 'j/ (r) is treated as a state with positive energy (E == | £ | ), 

and the state (r,t) = e (r) is treated as a state with negative energy 

= — l^l), we must interpret the sign of the energy in Ij; differently than in 



r.hapler 19 


The Dirac Equation in Approximate Form 


In many problems which are solved by the Dirac theory, we 
retain only the relativistic corrections of the order of ) in the 
final results. Therefore we may immediately write the Dirac equa- 

( V 

tion in an approximate form, retaining quantities of the order of yyj . 

It will be shown below that the role of both the relativistic and spin 
terms is clearly displayed in this approximation. 

Let us consider the motion of an electron with positive energy 
in an electromagnetic field which does not depend on time. In 
this case we may replace the energy operator by its eigenvalue, 
separating out the rest mass energy moc*: 

E — moC^ + E. (19,1) 

The wave equation (17.20) then becomes 

(E - e<I>) (a’-P) (J j , (2m^^ + E - e<I>) ('J j (a'-P) ({j . (19.2) 

where a' stands for 2x2 Pauli matrices [see (16.18)], and P = 

p — ^ A, Equations (19,2) are simply a different form of the exact 

Dirac equation. For the components of we obtain from (19,2) 

(E — — c (P^^ — iPy) ~ = 0 

and so on. This equation can also be obtained from the first of Eqs, 
(17.24), if we substitute into it both (19,1) and (17.25), 

As was mentioned in the preceding section, in the nonrelativistlc 
limit the components and are ‘‘smalT* for positive energy 

states, since they are of the order of y times the ‘Targe functions 

and The transition to the approximate Dirac equation consists 
in eliminating the “small” components -jig and ''»i from Eqs. (19.2) 

and retaining terms of the order of in the remaining equations 

for the “large” components and 9^. Thus from (19,2) we obtain 



(19.3) 
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First of all we change from the four- component wave functions 



to the two-component functions I 




=«(?;;), 


by setting 


(19.4) 


where N is the normalization coefficient. This coefficient may be 
determined from the ‘‘renormalization” relationship 

(tftftJ'i't) = (mi) . (19.5) 


Since the “small” wave functions and ^4 occur as squares in 

(19.5), we may set 



(19.6) 


that is, in (19.3) we neglect second-order terms and replace P 
by p. This change of operators is permissible in calculating the 
normalization coefficient since they differ from one another by a 
first-order term (inversely proportional to the velocity of light). 
When applied to second-order terms (squares of the “small” 
wave functions), this term of P gives only third-order terms, 
which we discard. Then, substituting Eqs. (19.4) and (19.6) into the 
left-hand side of (19.15) and using the equation^ 

(or' - a) (a’ - b) = (a- b)+ <p • (ax6)J, (19.7) 

which holds both for the Pauli and Dirac matrices, we find 


N (19-8) 

From this we obtain 


yv=i-~ 


8w»c» * 


*In order to prove this equality, we may, without loss of generality, assume that vector 
a IS directed along the 2 axis ^ Oy ^ 0, - a), and that vector b is located in the zx 

plane (by - 0) Then we obtain 

(a' ' a)(fj' • b) - «) (-^^3 ^{b^) ^ + la^ah^ ^ 

a b \ t[a ‘ (b X a')\ a 


•6 t 1 [a ' • (a X 6)] 
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This approximation gives us, in accordance with (19.3) and 
(19.4). 




Smtc‘ )\r,)’ 



(19.9) 


Substituting (19.9) into the first of Eqs. (19.2) and neglecting terms 
of the order of ( i/ / c )^ , we find 




e(I> 


1 






- l-iL. ("'•I’) -(o'-P) (o -P) - ns'V } © 


Using the relation (19.7) , we have 


(o'. P)(o'.P) = P* + t[o'.(Px P))= 

— P* — (»'[(P Xi4)-|-(i4 X p) 1)=P* — y |o'.(v Xi4)]= (19.11) 

where //is the magnetic field since the operator v acts only on the 
vector potential A, and not on 

In exactly the same way, with the aid of the relations 

(o', p) (£ — e<I)) (o', p) = (£ — eft) p’ — (o'. E) (o', p) = ^^9 jg) 

= (£ — e<t) p' — ihe{E'^)-\-eh\a’~{E x p) ] 

and 

===p^(£-r.p) = (£ — c<l))p*-l-^f- £.p (19.13) 


where £~ — is the electric field intensity, we may reduce the 
Dirac equation (19.10) to the following approximate form: 


- 5 .)(::)=[ 


2mi 

e% 


'M_r_ p‘- 
«''(£XP)1+ s,ny 


Smlc^ 

ti^e 






(19.14) 


The left-hand side of Eq. (19.14) describes the motion of a particle 
with a nonrelativistic velocity in a stationary electromagnetic 
field. The right-hand side of (19.14) contains an additional inter- 
action energy that describes the relativistic and spin corrections. 
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The first term on the right-hand side of (19.14) 


takes into account the correction due to the relativistic velocity of 
the particle. A similar additional energy must also appear in the 
relativistic Klein-Gk)rdon equation. The classical analog of this 
term will be obtained if the relativistic expression for the Hamil- 
tonian is expanded in a series, retaining terms of the order of 

{t»/c 


H = V //V* + + -g- - 

The second term on the right-hand side of (19.14) may be written 
as 


(19.16) 

From this it is clear that the quantity p = & may be treated 

as the Dirac magnetic moment of an electron, which appears ex- 
plicitly in the nonrelativistic approximation only through this 
transition. This interaction energy turns out to be of the order of 
t'/f. From the intrinsic angular momentum [Eq. (18.6)] of the 
electron 


S = |o', (19.17) 

we find the relationship between S and p that is required by ex- 
periment and follows automatically from the Dirac theory 





s. 


(19.18) 


The next term of the expansion characterizes the so-called spin- 
orbit interaction 


which describes the interaction of a moving magnetic dipole with 
an electric field. 


This Intsrsctlon msy also b. Interpreted from the clesslcel point of view In the follow- 
ing way; a magnede dipole moving with a velocity v (the spaUal component of a tensor 
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quantity) acquires an additional electric moment (space-time component of the same 
tensor quantity) 

I*el = 7 ® ^ ^ ^ (19.20) 

which interacts with the electric field of the nucleus. This additional energy of inter- 
acaon is 


=- /-.liej 


e/l 


lo:(Exp)i 


( 19 . 21 ) 


This classical expression for the interaction energy is twice as large as the correspond- 
ing quantum expression [see (19,19)]. Even before the advent of the Dirac theory, an 
attempt was made to explain the fine structure by the semiclassical introduction of spin- 
orbit interaction, lo obtain an agreement with experiment, lliomas and Frenkel sug- 
gested that we substitute the coefficient 1/2 into the classical expression for the inter- 
action energy (19.21), This interacuon, which follows automaucally from the Dirac 
theory, is called the Ihomas-Frenkel correction. 

In particular, for the Coulomb field of a nucleus 

e = ~-e„. (19,22) 

The interaction between the moving magnetic dipole and the 
nucleus according to (19.19) becomes 

S-L, (19.23) 

where S=h0V2 is the spin, and L=^rxp is the orbital angular 
momentum. 

We note that there is no spin-orbit interaction for an atom in 
the s state since the orbital angular momentum in this state 
vanishes. Finally, the last term of the interaction, which in the 
case of the Coulomb field is equal to^ 


(19.24) 

is called the contact interaction. The additional energy corre- 
sponding to it 


^£cont_ \ d'x 


(19.25) 


^J. I. Frenkel, Wave Mechanics, New York Oxford University Press, 1938, Vol. 2, 
see also (16.11), where the coefficient 1/2 appears in the magnetic moment produced by a 
moving charge. 

^In the derivation of (19 24), we have used the fact that, according to (4 78), 

= -4778 (r) - -4778 (x) 8 (V) 8 (2) 


(19 24a) 
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is proportional to I '5* (0) and it will differ from zero only for the s 
state since, according to (13.28a), only in this case | ^ (0) 0 . 

For all other states {I ^ 0) this square of the wave function vanishes 
when r = 0. In this sense the contact term may be regarded as 
the spin-orbit interaction for the s state. We can see now that 
the last two terms in the interaction energy (19.14) characterize 
the spin properties of an electron. 

^Problem 19. l*j Show that the matrix ca is the velocity operator, and that In the case 
of 'a 'TFWpWTlcle a is not a constant of the motion, unlike the momentum operator 
(p = — i/iv) . Explain this difference. Determine in what case the average velocity 

® , and so on. 


will be related to the average momentum by the classical relation 

® = ( 19 . 26 ) 

Hint, From the Hamiltonian (17.21) it is possible to obtain the velocity operator 
r = Y (Hr— rH) = co, 

It can also be shown that the velocity is not a constant of the motion, since aqLO , Con- 
sequently, if for a given k we take a linear combination of positive and negative energy 
states [see Eqs. (18,53), (18.54)] there will exist interference terms that wiU fluctuate 
with time As a result, Ehrenfest's theorems wUl hold only on the average 

in the Dirac theory. The interference terms wiU disappear in the calculation of the 
average value of the momentum operator. 

Equation (19,26) holds only if the states with positive energies are retained (e= 1). 



Chapter 20 


The Fine Structure of the Spectra of 
Hydrogen-like Atoms 

A. STATEMENT OF THE PROBLEM. 

The problem of the motion of an electron in a hydrogen-like atom 
(Kepler’s problem) is rightfully considered as the touchstone of all 
forms of quantum theory. There are two main reasons for this. 
First, it has great physical significance, since the problem of 
motion in a Coulomb field can be solved exactly. Second, the 
results may be compared with experiment to a high degree of 
accuracy; for example, the emission spectra of atoms can be 
observed by optical and microwave spectroscopy. 

The solution of the problem of motion of an electron in a 
Coulomb field of a nucleus (hydrogen atom) on the basis of the 
Schrodinger equation, obtained in Chapter 13, gives an expression 
for the energy 


— . ( 20 . 1 ) 

which is in good agreement with experimental data. This ex- 
pression for the energy may be taken as the zero-order approxi- 
mation. A more detailed study of atomic spectra shows, however, 
that the spectral lines have a fine structure which, of course, must 
be associated with the detailed structure of the energy levels. The 
Schrodinger theory does not give an adequate description of the 
regularities frequently occurring in spectra, since it neglects at 
least two important facts: the relativistic dependence of mass 
upon velocity and the spin properties of the electron. Both these 
facts, as we already know, are accounted for by the Dirac theory, 
and therefore application of the Dirac equation to the Kepler 
problem gives results that accurately describe the multiplet 
structure of energy levels. 

As was pointed out, the Kepler problem can be solved exactly 
in the Dirac theory. The solution, however, requires many 
tedious calculations (much more complicated than in the Schrodinger 
theory, because in this case we have not one but four equations). 
Moreover, in the course of these calculations one does not always 
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perceive the physical meaning of the results, the analysis of which 
is of primary importance to us. We shall therefore use a more 
elementary method, based on the approximate equations of the 
preceding section. This method not only enables us to obtain 
formulas characterizing the fine structure up to terms of the 

order of (-^1 » but also to interpret the individual terms as 

manifestations of relativistic or spin properties of the 
electron. 


B. RELATIVISTIC AND SPIN EFFECTS 


As follows from Chapter 18 [see (18.24) and (18.25)], the wave 
function of a particle obeying the Dirac equation, taking into 
account the spin properties, is 


( 20 . 2 ) 

Here is the radial part of the wave function and is a spher- 
ical spinor: for y=:/ -|-7ithe spin is parallel to the orbital angular 
momentum and for / = ^-~V ‘2 it is antiparallel. 

Although terms of the order of (vjcy are not formally accounted 
for in Eq. (20.2), the relationship between the spherical harmonies 
in the spherical spinor that determines the zero-order approxima- 
tion of the wave function is established by the spin-orbit interaction, 
which is of the order of {v/cY^ 

The spherical spinor can therefore be used only when the atom 
is not subject to external perturbing forces of magnitude greater 
than those involved in the spin-orbit interaction. If that is not the 
case, the spin-orbit coupling will be disrupted and a new set of 
premises must be set forth in order to establish a relationship 
between the spherical functions. 

Spherical spinors, just as spherical harmonics, satisfy the 
equation 

(20.3) 

therefore, taking into account (11.17), the radial function in (20.2) 
satisfies the same equation that was derived in the nonrelativistic 
Schrbdinger theory: 








2mj, IfS 

r ' 




R, 


nl " 


^ There are several cases in quantum mechanics in which a small interaction energy 
enables us to find a relation between the coefficients of the functions in the zero-order 
approximation We have already encountered a similar situation in the treatment of the 
Stark effect (see Chapter 14). 
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The wave function (20.2) completely determines the selection 
rules for all quantum numbers. The selection rules for the quantum 
numbers /, /, and nij are given by formula (18.39), while the selec- 
tion rules for the principal quantum number n will evidently be 
the same as in Schrbdinger’s theory [see (13.48c)], since the radial 
function remains unchanged. Considering all this, we obtain the 
following selection rules for a theory of the hydrogen-like atom 
which takes into account spin effects: 

= A/ = 0, itl, Am^=0, ±1 (Am is an integer). (20.4) 

As for the expression for the energy, we cannot restrict ourselves 
in this problem to its nonrelativistic value (20.1), since the 
latter does not determine the fine structure of the energy levels. 

Knowing the zero-order approximation of the wave function 
(20.2), and also the additional perturbation energy describing the 
relativistic [see (19.15)] and spin [see (19.23) and (19.24)] effects, 
we may find the energy levels characterizing the multiplet struc- 
ture of the spectrum. 

According to formula (19.15), the relativistic correction to the 
energy levels is 


A£rel = — J (>rw) '• 

Since in the present case 

we see that this additional energy will be independent of the solid 
angles cp; that is, integrating over the solid angle we get 

= (20.7) 

Then the additional energy characterizing the relativistic effects 
is 


(20.5) 


( 20 . 6 ) 


= - 2 -^ mr + 2E^nZet (/-*) + Z^et (>'*)] = 

J^nz^a^ f n 3\ 

W+V« 4/* (20.8) 


where a = 


tie 


Vi37 is the fine structure constant. 
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In the derivation of (20.8) we have used (13.29a) 


— _ Z l _ 2RnZ 
^ \ eln^ * 

5 _ /Z \2 1 JlRZhn,_ 

~Uo/ + /i/iU7 + Va)- 


Equation (20.8) agrees exactly with the formula for the rela- 
tivistic energy, which was calculated in identical approximation by 
means of the relativistic Klein-Gordon equation [see (15.31)]. 

In a similar manner, with the aid of (19.23), we find the addi- 
tional energy due to the spin-orbit interaction 

^ (S • L) (A-“). (20.9) 


Using expression (13.29a) for (/— ') 

Trv (?TVJ(/ +T) ’ 

and expression (18.28) for(S • L) 


S • L 




for /^o, 
0 for l — O, 


we obtain the following value ^ for the energy (20.9): 


V /,-s.o. — Oft - 

az. 2n’ /(/+•'.)(/+ O' 


In these equations 


( 20 . 10 ) 


'7 = /(/-l-l)-/('-l-l)-s(s+l) = 


I for /==H-'/2. 
— (^+l)for / = ^ — '/)■ 


( 20 . 11 ) 


2 

At first glance it may seem that the spin-orbit interaction, which is inversely propor- 
tional to the third power of the distance, cannot give a stable state. This, however, is not 
so. At small distances the spm-orbit interaction behaves just like the relativistic inter- 
action, that IS, it IS inversely proportional to the square of the distance. Indirect proof of 
this IS the fact that AE® ° differs from only by a numerical factor of the order of 

unity 
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and the quantity 


X _ / 0 ^ ^ 0. 

1 for 1=0. 


( 20 . 12 ) 


Finally, the energy correspondingtothecontact interaction, accord- 
ing to (19.24), is given by 


2\£cont ^ 


^ 2mlc^ 


iw( 0 )r, 


where 


Y\rj; Y(fl. 

Furthermore, considering the expression for 


lRnl(0)r = ^s^O 



(20.13) 


[see (13.28a)], and using the fact that when/=:0 and 

y==^/ 2 . we find 

Pf(0)P = *^(|J. (20.14) 

that is, 

AE‘=°’^t=m~5w" (20.15) 

From this we obtain the following expression for the additional 
energy which accounts for the relativistic effects, the spin-orbit 
and contact interactions: 


AE = AE f A^cont = 

^ 4 2/(/ + V3)(7+1) 



^Incidentally, Eq. (20.10) for the contact interaction may be obtained when the expres- 
sion for the spin-orbit interaction (Eq. 20.10) is allowed to go to the limit as / -♦ 0, if we 
discard the factor 6/o m 20.10. Therefore, many authors use this procedure and neglect 
the contact interaction in deriving the fine- structure formula. However, the agreement be- 
tween the two formulas is accidental since for the a states the numerator of Eq. (20.10) is 
always zero, while the denominator vanishes only in the nonrelativistic approximation. In 
a number of other problems, for example, an atom containing several electrons, the energy 
associated with contact interaction is no longer a limit of the expression for the spin-orbit 
interaction. 
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Substituting here the value of q from (20.11), we have^ 

= - - ij. ( 20 . 16 ) 

Therefore, summing both results [(20.1) and (20.16)], we obtain 
the fine structure formula for the spectrum of a hydrogen-like atom® 


= + 


mz^\. , 
1 ' 




(20.17) 


From this it is seen that the splitting of the levels is proportional 
to the square of the fine structure constant. 


C. THE FINE STRUCTURE IN THE DIRAC THEORY 


When we take the fine structure into account the position of the 
energy levels in the hydrogen atom is found to depend also on the 
total angular momentum quantum number /. Therefore the terms 
will be denoted in the following manner: 








(20.18) 


From this formula it is seen that the fine structure, according 
to the Dirac theory, depends only on the principal quantum number 
n and the total angular momentum quantum number y. In contrast to 
the Kle in-Gordon theory, it is independent of the orbital angular 
momentum quantum number I (up to terms of the order of a^). 


"^This problem can be solved exactly in the Dirac theory. We then obtain a closed for- 
mula for the energy levels, in this formula, the first expansion term (which is independ- 
ent of a}) gives the nonrelativistic formula (20.1). The second term, which is proportional 
to , gives the additional energy (20.16). The third expansion term, which is proportional 
to in this approximation, can be neglected, since it is smaller than the so-called 
vacuum corrections, which are proportional to (see Chapter 22). 


^An exact solution of the Dirac equation gives the following generalization of Eq 
(15 30), which takes into account the relativistic effects in the case where spin is also 
present: 




1 \ 




(n-i-VaH \lo^ V2)^ A 


’ rriQC 


(20 17a) 


Equation (20 17) may be obtained from (20 17a), if the latter is expanded in a series and 
we restrict ourselves to the first two terms 

Since the minimum value of j is equal to 1/2, we find that stable motion in the Coulomb 
field of a point nucleus, according to the Dirac theory, will extend to Z^r “ 137, whereas 
in the Klein-Gordon theory it was limited by Z^r ~ ^ * 137 [see (15 33)] Such an increase of 
Z cr IS, as we have already mentioned, due to the slight compensation of the relativistic 
effects by the spin effects 
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The diagram given in Fig. 20.1 shows that all terms are doubly 
split, since to each value of / there correspond two values of /; 
for example, instead of a single term 2p (1=^1) we now have two 

terms 2pi . and 2/jt The exceptions 
are the .s terms (/ — 0), for which j 
can have only one value (y — ^ Thus 
the relativistic and spin effects some- 
what reduce but do not split the 5 terms 
(see Fig. 20.1). 

The degree of degeneracy also 
changes owing to splitting of the energy 
levels. We know that the principal 
quantum number may take the following 

values: 2 , 3 , 4 , The orbital 

angular momentum quantum number I 
varies from 1 = 0 (s’ state) to / =//—!. 
The total angxilar momentum quantum 
numter y takes the values j = l (/ 0) 

and j=\i,{l = 0) and, finally, the mag- 
netic quantum number ni, — - - y , . . . , -i - y, that is , for a given / there 
are 2 / + I half- integral values of nij. The degree of degeneracy, 
which is characteristic for any central field of force and is related 
to the equivalence of the various directions in space, is therefore 
equal to 2 / (- I for particles with a spin of 1/2 (we remember that 
for spinless particles it was equal to 2/ j- 1). In contrast to the rela- 
tivistic spinless theory, the degeneracy with respect to I is still 
present when we take into account terms of the order of and 
even the following expansion terms proportional to a^. When the 
finite size of the nucleus is taken into account, the degeneracy with 
respect to I is removed. We note incidentally that even greater 
splitting with respect to I is due to vacuum fluctuations (see 
Chapter 22). The magnitude of the splitting of spectral lines can 
be determined from the selection rules (20.4). For the Lyman 
series we then have two lines (instead of a single one): 


E/h-0 




3p3/2 , 3jJ 
1/2 

/ ^Pf/2 

Fir. 20.1. K ner^'y lt*vcl dta- 
Rram of the hydrogen atom 


u)(i) ==(lsi ,)-~(apu,), 

(weak line, since Ay — ^) (20.19) 

ojO* =z(\si J — (npi ,). 

The Balmer series lines are split as follows: 

o)<i'=(2s. 

lo'J' =: (2.vi J — (n/Jt ,), 

«)'" =(2/>.v,) — («s, ,), 

«)'» =(2pi,j) — (/I Si j, 

(O'V =(2/?, ,) — (nd. .), 


( 20 . 20 ) 
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and, finally, the transition 2pi , is forbidden, because in this 

case Ay — 2. If the degeneracy with respect to I is not removed, 
the lines and coincide, since the initial and final levels have 
the same values of the principal quantum number ti and the total 
angular momentum quantum number y. In a similar manner we 
may determine the splitting of all other lines. The lowest split 
energy level corresponds to « = 2. Let us consider in greater 
detail the splitting of this level in the case of the hydrogen atom 
(Z = l), which is the one most carefully investigated experi- 
mentally. In general, the n=^2 level would be split into three sub- 
levels, and, according to our theory, two of these sublevels would 
combine: 


(2p,.J = f [l - 


( 20 . 21 ) 


The transition frequency between these levels is, according to the 
Dirac theory, 


Au)D = (2/)./, ) — i2py, ) = Ry6’ (20,22) 

which is about 1.095 • 10‘ Me.*’ If only the relativistic effects are 

taken into account (Klein-Gordon equation) the corresponding 
splitting is [see (15.32)] 

AcoK-g =.(25) -(2/9-1^;“'”; (20.23) 

that is, the frequency is almost three times greater than the one 
found from the Dirac theory. Consequently, the spin properties 
of particles somewhat reduce the influence of the relativistic 
effects. 

The conclusions of Dirac’s theory have been accurately con- 
firmed by experiment. 

It is interesting to note that the fine structure of the spectrum 
of the hydrogen atom was first theoretically calculated by Sommer- 
feld who applied a relativistic Hamiltonian to the steady states 
of the Bohr classical theory. Sommerfeld obtained [see (2.61)] 


Me - 10^ sec L that is, the angular frequency o), expressed in sec is connected 
with the frequency expressed in Me, by the relation 
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the following ejqpression for the relativistic theory (20.22) without 
taking into account the spin effects: 

(2s) _ (2p) = (20.24) 

Agreement of the Sommerfeld result with the conclusion of Dirac^s 
theory was, however, only accidental* Sommerfeld’ s theory did 
not take into account the spin effects, and therefore it was unable 
to predict the splitting of the n = 2 level into three sublevels, the 
presence of which was later confirmed experimentally. 


D. EXPERIMENTAL VERIFICATION OF THE FINE 
STRUCTURE THEORY 

The major accomplishment of Dirac’s theory was its treatment 
of the fine structure. The theory was in good agreement with the 
experimental facts and was able to explain this structure as a 
manifestation of the relativistic and spin effects caused by the 
motion of the electrons within the atom. However, further and 
more detailed studies showed divergencies between the theory and 
fact. Thus, special attention was given to the 251/3 and 2pi/2 levels 
which, according to the Dirac theory [see (20.21)], should coincide 
in a hydrogen atom. Among spectroscopists, doubts about the 
validity of this conclusion were expressed as early as 1934, How- 
ever, the techniques of the time did not allow greater experimental 
accuracy, and the discrepancy between the theory and optical 
observations (that is, the splitting of the levels) being small, no 
great attention was paid to it. Better experimental data on this 
splitting were obtained considerably later, when microwave spectro- 
scopic techniques were used. 

The microwave spectroscopic method was invented and rapidly 
developed in the postwar years as a result of technical progress in 
microwave engineering.^ Microwave spectroscopy, which has now 
developed into a special branch of physics, gives valuable results 
when used in the investigation of nuclei, atoms and molecules. 
Microwave spectroscopic methods are also applied to the physics 
of solids and liquids. In 1947 Lamb and Rutherford employed this 
method to studies of the 2si/^ and 2pi/j levels, making use of 
a special property characteristic of the 2si/j state. This state is 


^By microwave ultrahigh-frequency radio emiaaion we mean the region of the electro- 
mametic apectrum located in the wavelength range from millimeters to tens of centimeters 
(10^ — 10^ Me). Successful application of microwave spectroscopy to the investigation of 
atomic spectra is due to the fact that the distances between the components of the levels 
split by the relativistic, spin and vacuum effects are of the same order of magnitude as the 
wavelengths in the microwave region. 



THE FINE STRUCTURE OF SPECTRA 


323 


metastable, since a dipole transition from the state to the 
lower Isi/^ state is forbidden by the selection rules A/ = 0 [see 
Eq. (20.4)].8 

Transition from the metastable state may be associated either 
with the emission of two photons (the probability of such a transi- 
tion is 10® lower than that for the allowed transition), or with 
a preliminary transition to the 2p level. Lamb and Rutherford 
investigated the latter type. 

Let us describe the general features of their experiment 
(see Fig. 20.2). A beam of hydrogen atoms in the unexcited Isi/, 
state is obtained as a result of dissociation of molecular hydrogen 
at high temperatures (tungsten furnace). A bombarding beam 
of electrons then excites some fraction of the atoms in the hot 
beam (approximately one out of 10®) to the metastable state 2si/^» 
The metastable atoms, unlike the unexcited atoms, readily give 
up their energy of excitation upon striking a metallic target. In 
so doing they remove electrons from the metal. The resulting 
current is measured by a sensitive galvanometer. 



Fig. 20.2. Diagram of the Lamb- Rutherford ex- 
periments on the detection of the splitting of 
the 2st/j and 2pi/, levels: 1) tungsten furnace 
emitting a beam of hydrogen atoms; 2) beam of 
electrons exciting the hydrogen atoms, 3) radio 
frequency field; 4)target, 5) galvanometer 


If the beam of metastable atoms is subjected in transit to a 
perturbation capable of causing a 2s->2p transition, then the atoms 
will almost instantaneously pass to the Isi/, state (prior to reaching 
the target). As a result, the current reading on the galvanometer 
is lower. 

In the Lamb-Rutherford experiment such transitions were 
induced by microwave radiation (the probability of the corre- 
sponding spontaneous transition, proportional to is vanishingly 
small as a consequence of the smallness of w); a strong damping 


®This IS correct for a dipole transition, but calculation shows that the quadrupole 
transition between these states is also forbidden 
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action, resulting in a decrease of target current, was observed at 
some frequency w. This o> was assumed to be the resonance 
frequency which causes transitions or 2si/,-^2pi/, with 

a subsequent practically instantaneous transition to the Isi/., level; 
the energy difference between these corresponds to tiw. Thus, 
one can very precisely measure the relative positions of the 
levels 


2 si/, 2 pv 3 and 2/;^//. 

These measurements showed that the level 2st/^, is shifted 
upwards relative to the level 2/h/, by approximately one tenth of 
the distance between the doublet levels 2pi/.. — 2/;i/,, which is equal 

a- 

to /?. The arrangement of the levels of a hydrogen-like atom 

(/I — 2) derived from the Lamb-Rutherford experiment is given 
in Fig« 20,3. The disposition of these levels according to the 
Dirac theory is given for comparison. According to the latest 
data, the shift of the 2si/, level is approximately 1057.77 Me 
or, in wavelengths, 28 cm. 


' 3/2 


me 


-2s, 


f/2 
’ 2 Pf/2 


'2p3/2 


■" 2S//2 , 2p,/2 


Fig 20.3. Splitting of energy levels in the hydrogen 
atom a) experimental data, h) according to the Dirac 
theory (neglecting vacuum effects) The frequencies 
of the corresponding transitions and the distances 
are given in Me. 

This apparently negligible discrepancy between theory and ex- 
periment led to remarkable progress in theoretical physics and, 
in particular, in quantum electrodynamics. This subject will be 
considered in greater detail in Chapter 22. 


^In the experiments of Lamb and Rutherford, the frequency of the microwave radiation 
was fixed and the resonance condition, corresponding to the difference in the Zeeman com- 
ponents between the states and 2pi/j or 2p3^, was obtained by adjusting the magnetic 
field Then, extrapolating the re.sults to the case = 0, the authors found the level 

shift. 
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E, ANOMALOUS ZEEMAN EFFECT 

The complete theory of the Zeeman effect (both normal and 
anomalous) must be based on the Dirac theory, because the latter 
takes into account both the relativistic and the spin corrections. 
Since the anomalous Zeeman effect is due to the spin effects in 
the atom, neither classical theory nor Schrodinger^s wave me- 
chanics was able to give a satisfactory explanation of the Zee- 
man effect, and for obvious reasons. 

As a starting point of the theory let us take the approximate 
Dirac equation (19.14), in which these effects are taken into account 

up to the terms [y) . Let the magnetic field be directed along the 

z axis, that is, -- 7/^ = 0, = Then, using the fact that, 

according to (16.16), 


2mo 


_P! t 

2///0 ^ lUoC 


i4*p 





(20.25) 


we reduce Eq. (19.14), describing the motion of an electron in the 
Coulomb field of a nucleus, to the form 

(c -I- ^ (j;*) =:(V/rel + Vs.o.^_ l/cont _L V'mag) Q;. (20.26) 

where and Vcont are given by Eqs. (19.15), (19.23) and 

(19.24), respectively. Upon averaging of these terms 

AE„, = ij ('F, *F,) + KS.O.-1- ) (Ij;'') d'x (20.27) 

we obtain the fine structure formula (20.16), that isj° 

^E „,==- _ IV (20.28) 

When a magnetic field is present, we obtain on the right-hand 
side of Eq. (20.26) the interaction 

(— i + ai] , (20.29) 


^^Generally speaking, the spin-orbit interaction is of fundamental importance in 

this case. Since, however, the relativistic terms are of the same order as the spin-orbit 
interaction, we may set 


AE 


n; 


s o 


- AE, 
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which gives the following value for the additional energy of the atom: 

A/rmag^ J (iITMir,;) j J j cPx. (20.30) 

The appearance of either anomalous (case of a weak magnetic field) 
or normal (case of a strong magnetic field) Zeeman effects depends 
on the relative proportion between the additional energies on 
the right-hand side of (20,26). 

Let us assume that we have a comparatively weak magnetic 
field, whose interaction with the atomic electrons is smaller than 
the relativistic or spin-orbit interaction. 

Then the zero-order approximation will be expressed by the 
wave functions (20.2) that are obtained when the spin- orbit coupling 
is retained. 

Substituting these functions into (20.30), the additional energy 
becomes 


A£ mag_ J 1 1 ^ dr I dQ (- / + a') Yy^. (20.31) 

() 

In (20.31) we should note that the integral over r is equal to 
unity 


f (20.32) 

b 

Substituting in place of the spherical spinors their values from 
(18.24) and (18.25), and using the orthogonality condition for the 
spherical harmonics 


we find the following expression for the additional energy when 

+ V'2* 

A£niag == [(/ 1 _ _ i)j _ 

In exactly the same way when }=l — Va we obtain 


[(/ _ m+ 1) m + (/ + m) (m - 1)] = 

= (m — Vs) 27 ~FT' 
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Recalling that = the last two expressions may be written 

as a single formula 


^E^^S=zlLf^oW*gmJ — ohgmj, (20,33) 


where o = ^-— is the Larmor frequency, and theLandeg- factor is 

(20.34) 

Thus, in the case of the anomalous Zeeman effect, the expression 
for the additional energy contains the Lande g factor, which in the 
case of the normal Zeeman effect [see (16.23)] is equal to unity.. 

The additional energy (20.33) does not lead to the usual triplet 
splitting (normal Zeeman effect), but to a more complex splitting 
pattern (anomalous Zeeman effect). 

In view of the fact that trij can assume 2/-I-I different values, 
each level in the case of the anomalous Zeeman effect is split 
into 2/+ 1 separate sublevels; that is, the external magnetic field 
completely removes the degeneracy, which is present even in the 
relativistic theory of the hydrogen atom. 



Fig. 20.4. Zeeman effect: a) position of the energy levels 
in the absence of a field, h) anomalous Zeeman effect, 
c) normal Zeeman effect 


To obtain the splitting pattern, it is necessary to take into 
account the value of the Lande g factor ( g — 2 for the si/^ states , 
g — Vs for the Pi/., states, g = */'3 for the /?j/. states, and so on) and 
also the selection rules for the magnetic quantiun number nij. In 
particular, when A/iiy = 0, the emitted components are polarized 
parallel to the z axis (that is, parallel to the magnetic field), and 
when Ar?ij = ±: 1 the components are polarized perpendicular to the 
magnetic field. 
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Equation (20.33) gives us the following value for the frequency 
of the radiation: 


OJ — 0 )„ 0 — grtij), (20.35) 

where is the frequency of the radiation in the absence of a mag- 
netic field ( J/" — 0 ); g' and g are the Lande g factors of the initial 
and final states; the magnetic quantum number nij of the final state 
may take three values: nij ni] • 1 . 

Figure 20.4b shows the splitting of the spectral levels F'si . and 
27 ;! . in a weak magnetic field, the Laixuor frequency being taken 
as the unit of the splitting. From Fig. 20.4b it is seen that in this 
case there are four, and not three (as in the normal Zeeman effect) 
shifted lines. The magnitude of the displacement is given by 
(20.35). 

In the case of a weak field [according to (20.34)] we find 

Hence 


/^(Uj — “j- 0^ \ii), ^ 0, 

Atuj — I 0, Aioj 0 (20.36) 

Equation (20.34) for the I ande j[| factor is applicable to the hydrogen 
atom and to atoms having a single valence electron. In the general 
case, the Lande g factor becomes 


I I f L ^ f n f- r L 


(20.37) 


where L, S, and J are the orbital, spin and total angular momenta 
of the atoms and 


/ = |L±5|. 

In particular, for elements of the first group (/ = /, L = ly 5 = V 2 )» 
Eqs. (20»37) and (20.34) are identically the same. The Lande g 
factor attains its maximum value for s states (1 = 0, j=:s=^l^): 

gs = 2 (20.38) 

For atoms with two electrons in the outer shell (for example, 
helium atoms), single lines (5 = 0, J = L) are possible along with 
the triplet state 5=1. For the single lines we have 1, and, 
therefore, in this case spin effects should be of no importance | 
only the normal Zeeman effect (that is, triplet splitting) should be 
observed in either a weak or strong field. 
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F. STRONG MAGNETIC FIELDS. PASCHEN-BACK 
EFFECT 

It has been indicated that the anomalous Zeeman effect appears 
in the case of weak fields, when the external magnetic field cannot 
disrupt the spin-orbit coupling. 

Mathematically this means that A/i^®g [see (20.33)] is much 
smaller than the natural splitting of the lines given by Eq. 

(20.28) 

AEtt^ag (20.39) 

In the latter case we first solved the problem by taking into account 
the spin-orbit interaction; this establishes a relation between 
the spherical harmonics that form the spherical spinor (18.24) or 
(18.25); then we found an additional energy that leads to the 
anomalous Zeeman effect, since the Lande factor does not equal 
unity. 

In the case of strongfields, when the splitting due to the external 
magnetic field is greater than that due to the spin-orbit interaction 

AE»Tia}^ -AEs-^ (20.39a) 

the magnetic field “breaks** the spin-orbit coupling and the zero- 
order approximation solutions, expressed in terms of spherical 
spinors [see (18.24 and 18.25)], are no longer true. 

In this case we may neglect the interactions V'*’*^* and 

I'cont (20.26), which, when (20.29) is taken into account, Ixicomes 



Inio 





(20.40) 


Using the fact that the functions 'F, and must be proportional to 
the spherical harmonics Y'", with — i j-y™ = mY^, we find two 
independent equations for these wave functions; 




(20.41) 


from which it is evident that the additional energy equals 

(m + 2m,). (20.42) 


and 


A (A£„^)mag= (a,„ + 
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that is, the wave fiinction corresponds to the case in which the 
electron spin is directed along the magnetic field (mj = VO* 
wave function 'i'a corresponds to the case in which the spin direc- 
tion is opposite to that of the magnetic field. 

If we choose m such that the same energy value is obtained 
for both functions, then we must set — 1 for the function'll , 

and m .2 — m + 1 for the function 
In this case the wave functions 


= (20.43) 


will be mutually orthogonal, so that 

|j (20.44) 

0 

Since the interaction between the atom and the external magnetic 
field (20.29) contains only the matrix <?'„ which does not couple the 
wave functions 'i*, and 'F>, transitions from the state with m,= V ‘2 to 
the state with m, » induced by this interaction, will be for- 

bidden in this case and hence A/ 71 ^ — 0 

Taking into account this circumstance, and also the selection 
rules for the quantum number tn — :L 1 ), we find from (20.42) 

an expression for the Zeeman splitting of the spectral lines 

A(u=:oAm~0, ±:o, (20.45) 

which agrees with the result of Schrbdinger^s theory, which explains 
the normal Zeeman effect (triplet splitting of the spectral lines). 

Thus, in strong fields the anomalous effect 

becomes the normal effect, which is in agreement with experi- 
mental data (Paschen-Back effect). It is interesting to note that 
the passage from the anomalous Zeeman effect to the normal effect 
can be illustrated by Fig. 20.4, if the Lande fy factor is set equal to 
unity (see case c). Then the splitting will be 

Atoj == Awj — 0, A(o,^ = o,and Ao)^ = — 0 ; 

that is, we obtain three components of the split line Instead of four. 

In special cases, when AE®*°V:; A£“®® for one energy level and 
conversely A£s*o*> aE "^®8 for the other level, or when the energies 
of both levels are of the same order of magnitude, the Zeeman 
splitting becomes complex. Since these are all special cases, we 
shall not elaborate them here. 



THE FINE STRUCTURE OF SPECTRA 


331 



roblem 20. 1 J Investigate the diamagnetism and paramagnetism of atoms by placing 
them^Tfn^WBSfant homogeneous magnetic field (//^ 
investigation of the Zeeman effect, keep all terms conti ning g^as well as terms pro- 
portional to < 2 %^* in the Hamiltonian [see (20.25)]* 

Indicate the atoms in which diamagnetism may be observed. 

Solution, When terms portlonal to e^^stre taken into account, we have, instead of 
(20,25) 


pa 

2Wo 


2mo 




Consequently, on the right-hand side of (20,26) we have another term 


rdiam 


8/;inC^ 




which in conjunction with (20,33) gives the following expression for the additional energy 
of an electron in a magnetic field: 


= (20.46) 

Here when calculating the perturbation energy proportional to we have used the 

spherical symmetry, which must occur in the zero-order approximation "=0), and 
have set 


J >ii‘+(^»+y)rd>A-= I J 'rir=u 

Hence the magnetic moment of an atom in a magnetic field is 
lA/-;'"® 

- r»s..v 6,„^, 


rf’.l = 

o 


atom param t diam 

F = “ - Vogm, • (20.47) 


The latter relationship is a generalization of a familiar equation relating 
in the case where A/^niag jg a nonlinear function of , . 

The diamagnetism^ of atoms is characterized by the second term of Eq, (20.47), which 
is proporuonal to 

The magnetic susceptibility per gram-atom is 


.^diam 




& " 


ylN_ pr 


(20.48) 


where N is Avogadro*s number, 

'Hie quantity /.diam ig never zero and must always be negative (^“',^*0). 'Iherefore, 
the diamagneuc effect must occur in all atoms. 

As for the first term on the right-hand side of Eq, (20.47), which is proportional to 

my, it may take either positive or negative values, since ^ ^ ^ 1 ~ A 

In a state of thermodynamic equilibrium, however, the negative values for my, which 
give a smaller value for the energy, will be preferred. On the average we therefore 
obtain a positive value for the paramagnetic susceptibility ‘ * 

..param___ /(y + 1) 

~ kT 3 


Expression (20.47) is obtained for a weak magnetic field, when the anomalous Zeeman 
effect occurs. It can, however, be easily extended to the case of a strong magnetic field. 
To do this we must set 1, = m in (20,47) [see (20,42)]. 


1 1 


See R Becker, Electron Theory. 
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Since the paramagnetic susceptibility is considerably larger than the diamagnetic 
susceptibility (XP&^am ^ 7 dlam)^ the atoms exhibit paramagnetic properties when 77 ^ 0 * 
For hydroger-like atoms } differs from zero (the minimum value of / equals Vs) and, 
therefore, they are always param^netic. Only for atoms with an even number of 
electrons can the quantum number / vamsh (for example, parahellum in the ground state, 
see Chapter 24), Such an atom will be diamagnetic. 


^Prob lem 20.^ Adding geometrically the orbital and spin angular momenta, show that 
the^fBWwrtWK^Zeeman effect is associated with the fact that the total magnetic moment 



Fig 2() 5 Goometric interpretation of the 
anomalous Zeeman effect 


|i is not parallel to the total angular 
momentum J, With the aid of the geo- 
metric model, also explain the Paschen- 
Back effecL 

Soluuon. First let us find geometri- 
cally the angular and the magnetic mo- 
ment vectors 

J = L + S, , = -.^^(L + 2S). 

In the geometric representation (see 
Fig. 20.5) we may choose the scale so 

that u = — TT--— (it IS immaterial at 

‘ 2moC 

the moment whether the vectors are 
parallel or antiparallel). 

Tlien 

|i/ L, and ji^ = 2S; 


that IS, the total magnetic moment |t will undergo two rotations in a magnetic field: one 
with an angular velocity around tlie direction of the total angular momentum (this 
angular velocity corresponds to the frequency associated with the transition between 
components of the spin-orbit splitting of the spectral lines <0 ^ and the other corre- 
sponding to the Larmor frequency of precession around the direction of the magnetic 
field H (//, -= //^ = 0, U, = 

When (D >* 0 , the additional energy should be calculated from the relation 


mag „ * J ) (J * H) 

Since the magnetic moment is directed on the average along J, we have 

^ g^hmjy 

where 0 = - , and the Lande g factor is equal to 

ZIHqC 

I S 

^ cos (Z- *7) + 2 -J cos (5*y). 


Substituting the values of the cosines of the angles 

cos(S.y) = — . cos(t-y)= — 2 ji7 


and remembering that js— * i)f and so on, we obtain the expression (20.37) for 

the Lande g factor. Let us pay attention to the fact that if the vectors ji and J were 
parallel, the Lande g factor would be unity. 
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In strong fields o > <o we must consider independently the rotation of the orbital and 
the spin moments \i around the z axis. Then the additional energy becomes 

i£-mag 3, _ (L. + 2S,) = oft (m 4- 1), 

which leads directly to the normal Zeeman splitting (Paschen-Back effect), 

/Problem 20.^ By means of the relativisuc scalar wave equation and the Dirac equa- 
tioil, llIH! 'tllS 'frequency of the allowed transitions between the /z =-- 2 and;z = ;i states. 
Show that, according to the Dirac theory, there are seven lines, five of which are distinct, 
and that, according to the scalar theory, there are only three disanct lines. 



Chapter 21 


The Effect of Nuclear Structure 
on Atomic Spectra 

A. INTRODUCTORY REMARKS 


As has been mentioned in Chapter 13, the position of the spectral lines is shifted when 
the finiteness of the nuclear mass is taken into account. The Rydberg constant R in the 
expression for the energy of a hydrogen-hke atom 


„ Rnz^ 

is somewhat reduced and becomes equal to 


( 21 . 1 ) 


( 21 . 2 ) 


where 


^co 

is the Rydberg constant corresponding to infinite nuclear mass. Consequently, the 
Rydberg constant will have somewhat different values for hydrogen, deuterium, and 
tritium. With the great accuracy attainable in modern spectroscopic techniques, this 
effect can be used to detect the presence of different isotopes (see Chapter 13). 

In a similar fashion, the finite size and the magnetic moment of the atomic nucleus 
have certain effects on atomic spectra. 


B. EFFECT OF THE FINITE SIZE OF THE NUCLEUS 

When the motion of an electron in the field of a nucleus is treated as a problem in 
classical theory. It is quite immaterial whether the nucleus is regarded as a point or as 
a particle with finite dimensions. All that matters is that the electron should at all 
times be outside the nucleus and that the nuclear charge should be spherically sym- 
metric, since the potential outside a spherically symmetric charge distribution is the 
same as the potential of a point charge. 

In quantum mechanics the situation is somewhat different. The wave function must 
differ from zero inside the nucleus; therefore, there is a certain probability (however 
small) that the electron will be located inside the nucleus. Consequently, the charge 
distribuQon inside the nucleus must in some manner influence the energy levels of the 
electrons in the atom. 

To estimate the effect of the finite nuclear size on the energy spectrum of a hydrogen- 
like atom, we shall assume that the nucleus can be represented by a sphere of radius 
with charge distributed uniformly throughout the volume. The potential energy will 
be given by (see Fig. 21,1) 




Zej 

Rj^ 


( 


2 


R% 


for r < R^r, 


(2U) 
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K = for (2U) 

The shift of energy levels due to the finite size of the nucleus can be calculated with 
the help of perturbation theory. We shall assume that the perturbation energy consists 
of the difference between the potential energy of a point nucleus and the potential energy 
of a nucleus with charge uniformly distributed over the nuclear volume 





± iL 

2 



<21.5) 


With 


c 



I 

0 


for r < V, 
for 


The perturbing force therefore differs from zero only inside the nucleus. 


( 21 . 6 ) 



the variation of the potential energy 
which would be obtained inside the 
nucleus, if the potential were de- 
scribed by the Coulomb law both inside 
and outside the nucleus 


In first-order perturbation theory the shift of the levels is given by 
AEvol ^ ^*vol 


(21.7) 


Since I <1^ does not change appreciably in the region r ^ this integral can be readily 
evaluated by substituting for { (}< j> its value at the origin. 


^E 


vol 


% 




( 21 . 8 ) 
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Substituting the expression (20. 14) for \ <}/ (0) p , we obtain 


"“5 aln' * 


(21.9) 


that IS, in nonrelativistic theory, the shift of the energy levels 
for s states (/ = 0;. 

It can be shown that for p levels this shift will contain 

ti* 

(where 0 ^ — 00 ^ ), and for d states a factor of the order 


IS different from zero only 

( R Y 
/ 

of ( '^1 , and so on.^ Con- 


sequently, the shift in the energy levels for p and d states can be neglected in the first 
approximation. 

For hydrogen, the first-order shift in the energy levels is about 1 Me; this is much 
too small to account for the Lamb shift, which Is equal to approximately 1,057 Me (see 
Chapter 20), 

The volume of the nucleus is important in connection with the isotope shift, that is, 
the shift in the energy levels of atoms with the same atomic number Z and different 
mass numbers 71. The chief factors that give rise to the isotope shift are the different 
masses (the mass effect) and different volumes (the volume effect) of the isotopes. The 
mass effect is manifested in a shift of the spectrum lines towards the ultraviolet as the 
mass number A increases. For example, for Z — 1 the highest frequencies are found 
in tritium with A == 3. then deuterium with A = 2 , and, finally, ordinary hydrogen with 
A = 1 [see (21,2) and’ also Chapter 13], On the other hand, the volume effect is mani- 
fested in a shift of the spectrum lines towards the infrared as .4 Increases, For instance. 
It can be seen from Eqs. (21,1), (21.9) and (21,13) [see below], that the energy levels of a 
hydrogen-like atom will be given by the following expression when the shift due to the 
volume effect is taken into account: 


l--n 


11“ \ 5 aln 


Experiment shows that an isotope shift towards ultraviolet is observed in elements whose 
atomic number Z is less than 40-50. For elements with a larger value of / an Isotope 
shift in the opposite direction is observed, that is, towards the infrared. This indicates 
that, for relatively light elements, the isotope shift is caused mainly by the mass effect, 
whereas for heavier elements it is caused by the volume effect. This, however, is a 
rather simplified picture of the isotope effect, and other features associated with the 
structures of the atom (for example, nuclear spin and polarization of nuclei by electrons) 
also have to be taken into account. 


C. MESIC ATOMS 


The finite size of the nucleus has a particularly important effect on the position of 
energy levels in a meslc atom—an atomic system consisting of a ;i meson revolving about 
a nucleus. The p meson is a particle that has the same spin as the electron (that is, spin 


^To calculate the shift of the p levels we must substitute the second ♦^erm m the ex- 
pansion of |iA|(r) into (21 7). namely, 

since the main term |>/a( 0)P vanishes for p states 



THE EFFECT OF NUCLEAR STRUCTURE ON ATOMIC SPECTRA 


337 


1/2 In units of fl) and a mass 207 times greater (//i,jl = 207/Wo), so that tlie jx meson is 
basically a "heavy" electron, Mesic atoms can be produced by passing negative p mesons 
through matter. After losing its energy and slowing down, a p meson may be captured 
In an orbit about a nucleus, forming in this way a p-mesic atom, Mesic atoms have 
been obtained for almost all elements of the periodic system, from hydrogen up to 
the heavy elements (uranium, neptunium, and so on)/ 

Tlie motion of a p meson about the nucleus is determined mainly by the Coulomb 
attraction, just like the motion of an electron, A p meson, however, also has nonelectro- 
magnetic interactions with the electron— neutrino and nuclear fields; these interactions 
may result in spontaneous decay of the p meson into an electron, neutrino, and antineutrino 
(the lifetime of a p meson at rest is t = 2,2 x 10”^ sec). Tlie p meson has, in addition, 
a definite probability of being captured by a nucleus, Ihus the lifetime of a p-meslc 
atom is determined by two competing processes: natural decay of a p meson into an 
electron, neutrino, and antineutrino and nuclear capture of the p meson. In light mesic 
atoms ( Z 10) the probability of the first process is greater than the probability of 
the second; that ls> the lifetime of a mesic atom is determined by the lifetime of a 
p meson at rest (t^ 10““ secX For Z > 10, nuclear capture begins to predominate 
and the lifetime slowly decreases to 7 • 10 * sec (for Z = 82)/ 

In the dieory of p -mesic atoms, ordinary electrostatic interaction plays a fuiida- 
mental role. In the first approximation, we can regard the nucleus as a point charge 
and calculate the energy of the mesic atom and the radius of the orbit using the equations 
derived for an ordinary hydrogen-like atom, replacing the electromc mass by the mass 
of a p meson. Then the energy and radius of the orbit will be given by [see (13.33), 
(13,45)1 




ft* 




z » 


( 21 , 10 ) 


where m^ = 2(ilmQ , it can be seen that the energy of the p meson in a mesic atom is 
207 times greater than the corresponding energy of an electron in the atom, and that the 
radius of the orbit on the contrary is reduced by the same factor. If electrons remain 
in the atom along with the p mesons, they will move about the nucleus in considerably 
larger orbits than the p meson and therefore cannot exert a significant influence on the 
p meson rotating around the nucleus, A mesic atom, therefore, may be regarded as a 
hydrogen-like atom that can have both large and small values of Z. 

Since the radius of the "Bohr" orbit of the meson is 207 times smaller than that of 
the electron orbit, the probability that the meson will be located in the nucleus is con^- 
siderably greater than for an electron in a hydrogen-like atom. The main correction to 
the energy levels of a mesic atom will therefore come from the volume effect. The 
equation for the energy of s states can be obtained from (21,9) and (21,10) by replacing 
the Bohr radius Oq by the corresponding radius in a mesic atom 


a 


(j. 


ft* 


( 21 , 11 ) 


obtaimng, therefore. 


^lU /-o — 


/, _ i. 

\ 5 


Z>R),\ 


( 21 . 12 ) 


^These subjects are treated in more detail m a paper by D D Ivanenko and G E Pus- 
tovalova, Vspekhi fizicheskikh nauk, 61, 27 (1957) 

■^For 77 mesons, which strongly interact with nuclei and are responsible for nuclear 
forces, the lifetime with respect to decay into a /x meson and neutrino is equal to 2 6 X 
10 * sec, whereas the lifetime with respect to capture by a nucleus is many times smaller 
In particular, for 7r-mesic hydrogen the capture time of a negative pion from the orbit is of 
the order of 10 ^ ^ 


sec 
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Using the fact that the nuclear radius is related to the mass number A'^2Z by the 
expression 


where Ro = const, we obtain in the first approximation 


En, i-o = — 




1 

5n 



(2U13) 


(2U14) 


It follows that, for the s levels, the energy correction due to the nuclear volume will 
be proportional to • and therefore attains very large values for heavy elements. 

In heavy elements, the orbit of a p meson may even be inside the nucleus, at large 
^ and small n, Ihe Bohr radius of the mesic atom becomes equal to the nuclear radius 

for Z Z j* rrr 45 

For orbits inside the nucleus > Z^j .) , the main part of the potential will no longer 
be determined by the Coulomb law (21,4) but instead by formula (2L3) which corresponds 
to the potential of a three-dimensional harmonic oscillator (on the assumption of a 
simplified model of the nucleus in which the charge Is uniformly distributed over the 
volume), Ihus tlie energy has to be determined from the following equation instead of 
(13.4): 


d^‘R 

dr^' 


+ 2 

r 


dR 

dr^ ■ 


N 


Ze^ 


./(/ 


0 . 


(21,15) 


In Eq, (21,15) let us change the variable by setting R= }Z rR\ r= p. Since 




= 4pV. J 




it follows that the equation for the energy of a three-dimensional harmonic oscillator is 
formally identical with the equation for the hydrogen-like atom 


</p2 


, 2 dR' 

+ 7 -rfp- 




but has different values of the constants, namely^ 


(21,16) 
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(21.17) 


To determine the eigenvalues of the energy we may use Eq, (13.20), according to which 


Subsatutlng for B, A and /' their values from (21,17) and using the fact that k=^n- / - 1, 
we can find the energy of the meson in an orbit inside the nucleus: 

Eni==--Vo-iri<,{2k 


(21.18) 
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where the frequency of mechanical vibrations of the three-dimensional oscillator is 





Zel 


The quantity represents the zero-point energy of the three-dimensional oscillator; 
It is three times greater than the corresponding zero-point energy of a one-dimensional 

3 Ze^ 

oscillator* The quantity Ko= tt ^ - is the greatest depth of the oscillator potential well. 

Equation (21*18) is correct on the assumption that the potential energy varies in ac- 
cordance with (21,3) from zero to infinity. If the finite size of the nucleus is taken into 
account, an additional energy is obtained that represents the difference between the 
potential energy of the particle in the oscillator well and its potential energy in the 
Coulomb field (this difference being averaged over the space outside the nucleus X The 
equation for the additional energy Is 


A£ vol = Ze 


sf 


rfQ J H-p 


Rj^ 


/3 


2%] r 


r-dr. 


(2U19) 


Since the wave function of a spherical oscillator is similar to the wave function for a 
Coulomb field (it is determined from the same wave equation with r replaced by P = r*}, 
it decreases exponentially as r increases. The 

correction (21.19) is therefore significant only — ,.i. , , ■■ 

when the radius of the mesic atoms is close 
to 

As an example, let us consider the ?p Is 

transition in lead {Z = 82), If the mesic atom - — — 

of lead is assumed to have a point nucleus, the 
energy released in this transiuon can be found 
from (21.10) 


■ 

■ 2 P ,/2 


Am /‘V’ 

E, - /:\ = M Mev. (21.20) 

The relativlsac and spin effects in the - 
the amount ^ 


l ie 21.2 


Energy levels in 
atom 


a mesic 


transition increase this energy by 


A (A/:) ^2 Mev. (21.21) 

Incidentally, such a significant role of the relativistic and spin effects is due to the fact 
that the energy Is expanded in terms of (Zoi)®. This quantity is comparatively large for 
lead (Z= 82, a= 1/137X Comparison with experiment shows, however, that in this 
transition an energy of 6 Mev is liberated instead of the predicted 16 Mev, This dis- 
crepancy between theory and experiment arises because the 1 s state for lead lies inside 
the nucleus. In the Ip state the orbit lies outside the nucleus and the volume effect of 
the nucleus is small. If we take the energy of the Is level from (21,18), and the energy 
of the 2p level from (21,10) for a point nucleus, the energy of the 2p - ► Is transition 
will be 3,6 Mev, If we add to this the correcuon (21.19) for tlie energy of the Is level, 
the energy of the transition will come to about 5 Mev, which is relatively close to the 
experimental value, 

A study of the multlplet structure of the '}p level in mesic atoms enables us to 
determine the spin of a p meson. For a particle with integral spin, the level splits 
into an odd number of components (for spin 0 no splitting occurs, for spin 1 three lines 
are observed, etc.X Since the 2p level splits into two components and 2p8/., ), it 

was established that the spin of a p meson is 1/2, On the assumption of a point nucleus, 
the theoretical splitting calculated from Eq. (20,17) should amount to about 0.55 Mev, 
with the 2px/jj and 2s i/^ levels coinciding. When the finite size of the nucleus is taken 
into account in a mesic lead atom, the splitting of the 2/7«/3~2/7i^ levels is reduced 
to 0.2 Mev and the 2si/^ level is raised above the 2/?,/ level. This is illustrated in 
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Fig. 21.2. From the above data It follows that heavy me sic atoms will emit gamma 
quanta having energies of several Mev. Lighter meslc atoms emit x-rays. 

Because of the significant influence of the size of the nucleus on the spectra of heavy 
meslc atoms, the charge distribution inside the nucleus can be determined from an 
analysis of the spectra. It has been found that the value that should be substituted for 
Rq in the formula for the electromagnetic radius of a nucleus with mass number A: 

( 21 . 22 ) 


is 1.2 X 10-13 cm rather than 1.4 x 10-13 cm (the value assumed for nuclear inter- 
actions). 

Similarly, the multlplet structure of the spectral lines of me sic atoms can be used to 
determine the magnetic moment of the p meson; its value is close to the muon magneton 






(21.23) 


where the Landc g factor Is 

Tlie theory of ji -meslc atoms is based mainly on electromagnetic interactions. By 
contrast, in the theory of 7 r-mesic atoms, a great part is played by the nuclear Inter- 
actions, the theory of which is far from complete. Further experimental study of 
TT -meslc atoms and an explanation of the semiempirlcal laws that describe their be- 
havior will have important bearings on future work in the theory of nuclear forces. 
These topics, however, lie beyond the scope of this textbook. 


D. APPIJCATION OF THE DIRAC EQUATION 
TO THE NEUTRON AND THE PROTON 


The Dirac equation describes the motion of particles with spin 1/2, It applies to 
electrons as well as to protons and neutrons. In the presence of an electromagnetic field 
it is necessary to take into account the charge of the proton, as well as the so-called 
anomalous magnetic moments of tlie proton and the neutron. We recall that the energy 
of interaction between a charged Dirac particle and an electromagnetic field is 

<x*A . (21.24) 

In the nonrelativistic approximation this expression contains a magnetic moment due to 
the intrinsic (spin) angular momentum (h '2o) 

(2U5) 


This quantity is known as the kinematic or Dirac magnetic moment. In passing to the 
relativistic equation, we must replace the mass //Join Eq. (21,25) by its relativistic value 

//I,, 

Vl therefore, the Dirac magnetic moment vamshes as the velocity approaches 

the velocity of light (i; r). 

In addition to the Dirac magnetic moment, which appears only in the nonrelativistic 
approximation and which depends on die charge, a particle may hf ve an anomalous 
magnetic moment that does not vanish even in the relativistic case and is independent 
of the particle's charge. 

We sliall now find the energy of interaction due to the anomalous magnetic moment. 
The energy of interaction (21,24) of an electron with an electromagnetic field is a 
scalar quantity, since in four-dimensional space = \y~A^, A;;~Az» 

In the same wav the unit matrix I is the fourth component of the velocity matrix a 
(that is, 04 = /!)/ 'The interaction energy (21,24) may, therefore, be represented as a 
scalar quantity in four-dimensional notation 


4 

= (21.26) 

, 1=1 


‘*More precisely, the quantity fsee (17 32)1, where "^1,2,3,^ - d, will 

transform as a four-vector 
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The electromagnetic field forms an antisymmetric tensor of second rank 


where 

_d/l, 

dv, ’ 


(21,27) 


A-4 = ict. 



It follows that 

//v=//.., 

iE,=:I1u> 

(2L28) 


The interaction energy of the anomalous magnetic moment with the electromagnetic field 
is, therefore, given by 


y (21.29) 

JJ., V-=-l 

where is a second rank tensor composed of the Dirac matrices,*' 

Using the rules for transformation of a wave function under the Lorentz [see (17.38)] 
and spatial rotauons [see (17.39)], we can show that the quantities 


Pa’ll 


“ai — pa’i-’i 
«4'j = — 


aj3 — pjQa, 

^u~ — ipAi 




(21.30) 


are the matrix elements forming a second rank tensor. The energy of interaction between 
tlie anomalous magnetic moment and the electromagneDc field takes the form 


+ (21.31) 

An electron has a charge, a spin, and also a Dirac magnetic moment. Its anomalous 
magnetic moment is relatively small (see below), A neutron has no charge, but it does 
have an anomalous magnetic moment; this magnetic moment determines the interaction 
between the neutron and the electromagnetic field. As for the proton, it has both a 
charge and a spin, and hence a Dirac magnetic moment; in addition, it has an anomalous 
magnetic moment. It should be noted that nuclear interactions are of great importance 
in the theory of nucleons. 

E. EXPERIMENTAL DETERMINATION OF THE MAGNETIC 
MOMENT OF THE NEUTRON AND THE PROTON 


The procedure for determining the magnetic moment of the neutron, proton, and 
complex nuclei is basically the same as for the magnetic moment of the electron (the 
Stern^Gerlach experlmentX The basic principle consists in applying a magnetic field 
perpendicular to the direction of motion of the particle. The particle will react dif- 
ferently depending on whether its magnetic moment is oriented parallel or antiparallel 
to the field. 

Let us first consider the possibility of determining the Dirac magneuc moment 
and the anomalous magnetic moment of a free particle. Suppose a free particle moves 
perpendicularly to the z axis. The Hamiltonian describing its motion has the form 


H = cf,Cj,Pj, + cpiO^Pj, -|- ps/;ioC=. (21.32) 

The component of the intrinsic angular momentum perpendicular to the direction of this 
motion 


5 z = -2 


(21.33) 


s 


More precisely, the quantity 




IS a second rank tensor 
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does not commute with this Hamiltonian* The component of the total angular momentum 
along the z axis does commute with the Hamiltonian 

pjif "2 (2 1*34) 

and can therefore be determined exactly together with the energy* 

Let us evaluate the error in the determination of the orbital angular momentum by 
means of the uncertainty relation* If the origin of the coordinate system Is taken to be 
at the center of the wave packet, the error in the orbital angular momentum will be 

^y^px — ^x^py. 


In accordance with the uncertainty relation, we have ^Px ^ ^ , ^py 


ft 

-r- « Since the 


errors may be either positive or negative, we find 

The error will be minimum when | Ax | = | Ay ]* Thus the error A due to the 

translational motion of the particle is of the order of the spin, and therefore the per- 
pendicular components of the Intrinsic angular momentum and the Dirac magnetic 
moment cannot be determined simultaneously* 



Fig. 2 1 3 Experiments for determining the magnetic 
moment of the neutron 

We recall that the Stern-Gerlach experiment allowed for the determination of a mag- 
netic moment of a bound electron; since, however, the orbital angular momentum in the 
s state is zero, it was the spin (or Dirac) magnetic moment that was actually measured* 
In accordance with (21.31), the Interaction energy associated with the perpendicular 
component of the anomalous magnetic moment is 


— PPi®8 • (2 L 36) 

'This component commutes with the Hamiltonian (2L32) and can therefore be measured 
exactly* Consequently, it is possible to measure the magnetic moment of a free neutron 
when the magnetic field is perpendicular to its motion, as was done by Bloch and Alvarez 
(1940),** In their experiments when a beam of neutrons was passed through a piece of 
magnetized lron,7 the most pronounced scattering was observed for those neutrons whose 
magnetic moment was parallel to the magnetic induction vector inside the iron* The 
emerging beam consisted, therefore, mostly of neutrons whose magnetic moment was 
antiparallel to the magnetic induction vector. If the neutron beam passes now through 
two magnetized iron plates in succession, the experiment is completely analogous to 
the transmission of light through two Nicol prisms; that is, the first Iron plate acts as 


^’Similarly, it may be shown that only the longitudinal component of spin commutes 
with the Hamiltonian In principle, therefore, it also can be measured experimentally 

Neutrons have no electric charge and they pass quite freely through matter Actually, 
their only interactions occur on collisions with the nuclei. 
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a polarizer and the second as an analyzer. This phenomenon was used to determine the 
magnetic moment of the neutron. 

A schematic diagram of the Bloch- Alvarez experiments is given in Fig. 21.3, 

Unpolari 2 :ed neutrons moving along the direction of the a' axis pass through the 
polarizer (first iron plate, with a magnetic induction vector directed upwards). 'The 
emerging beam consists mostly of neutrons whose magnetic moment is directed down- 
wards. These polarized neutrons will pass freely through the analyzer if its magnetic 
induction vector is directed upwards, like that of the polarizer. On the contrary, they will 
be transmitted much more weakly if the iron plates are oppositely magnetized. 

Between the polarizer and analyzer there is a device that reorients the neutron 
spin. This device is similar in principle to the one used by Rabl in his nuclear magnetic 
resonance experiments to determine the magnetic moment of the proton and of heavier 
nuclei. The basic principle of the Instrument is as follows. In the space between the 
polarizer and the analyzer, a relatively strong, constant magnetic field is applied 
parallel to their magnetization vectors (see Fig. 21.3). A neutron whose magnetic 
moment is antiparallel to this magnetic field acquires the additional energy 

A£anu=n„69f. (21.37) 

If, however, the magnetic moment is parallel to the field, the neutron loses an energy 

4/: par (2U8) 

In addition, there is a relatively weak oscillatory magnetic field II applied perpendicu- 
larly to the field 


= A cos 

This oscillatory field will reorient the spin of the neutron particularly strongly when the 
frequency wq is close to the resonance frequency 


f'o == 


^jr;anti.. ^/fpar 

h 


n • 


(21.39) 


At this resonance frequency, the number of neutrons undergoing a reorientation of the 
magnetic moment reaches its maximum value. To find when this happens, it is necessary 
to determine when the number of neutrons passing through the analyzer is minimum in 
the case of parallel magnetic Induction vectors, or maximum in the case of antiparallel 
magnetic vectors. 

Once the frequency ( 0 (, has been determined (it is, in effect, twice as great as the 
Larmor frequency of precession), the magnetic moment of the neutron can be found. 
According to recent data it is equal to ® 

(!„=-• 1.9l31|Jnuci 

where the unit for measuring magnetic moments is the nuclear magneton 


'‘nucr2';t = S 


. tlh 


where nip is the mass of a proton and /a® Is the Bohr magneton. 

From the resonance frequency of the oscillatory field, we can determine the magnitude 
of the magnetic moment but not its sign. If, however, we replace the oscillatory magnetic 
field by a rotating magnetic field, we can also determine the sign of the moment, since 
for resonance it is necessary that the vector equation ©o = — 2o should hold, where 

0 =- is the Larmor frequency of precession of the neutron spin. The minus sign 

shows that the magnetic moment of the neutron. Just as for the electron, is directed 
opposite to the spin. 


^Subsequent Improvements on the Bloch and Alvares experiments are described in 
E. Segr^, Experimental Nuclear Physics, Vol. I, 1953. 
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The resonance frequencies at which the magnetic moment of a neutron, proton, or 
other nucleus is reoriented in a field 10* oersted are In the microwave region. 

From (21,39) these resonance frequencies will be of the order of 10“ Me (this corre- 
sponds to a wavelength of several meters). 

The magnetic moment of the proton can be found only in a bound state since the proton 
is a charged particle and therefore has a Dirac magnetic moment in addition to the 
anomalous magnetic moment. The experiments cannot be carried out with individual 
atoms, but only with molecules in which the electron spins cancel out. For instance, 
the magnetic moment of the proton can be found by applying the method of nuclear mag- 
netic resonance to the molecule. To obtain molecules with polarized protons, we do 
not pass them through magnetized iron (H 2 molecules do not pass through iron), but 
Instead split the beam by passing it through an inhomogeneous magnetic field in the 
same way as in the experiments of Stern and Gerlach, This method of measuring the 
magnetic moment of a proton was first used by Rabt (1934), even before the experiments 
of Bloch and Alvarez, 

The magnetic moment of the proton has a positive sign; that is, it is parallel to the 
spin. It is equal to + 2,7928 (in units of a nuclear magneton). It contains two 

components: one (fx^ = 1)^ corresponds to the Dirac magnetic moment and the other 
(^anom = 1.7928) to the anomalous magnetic moment. The anomalous magnetic mo- 
ments of the proton and neutron are approximately equal and have opposite signs. The 
magnetic moments of the deuteron + 0.8573) and other nuclei have been deter- 

mined by similar methods. 

One of the main results of the experimental determination of the magnetic moments 
of nuclei was the fact that they are of the order of the nuclear magneton, whereas the 
magnetic moments of atoms are of the order of the Bohr magneton, which Is (triplma) 
times larger than the nuclear magneton. This difference served Initially as a strong 
argument in favor of the proton-neutron model of the atomic nucleus, according to which 
electrons (which have a large magnetic moment) are not present in nuclei. 


F. HYPERFINE STRUCTURE OF THE 
HYDROGEN SPECTRUM 


I'he hyperflne structure of spectral lines is due to the Interaction between the mag- 
netic moment of the nucleus and the magnetic moment of the electron. If the nucleus of 
the hydrogen atom (Z = 1) has a magnetic moment Pp = jxpo' where 9p are the 
Pauli matrices acting on the wave function of the nucleus, it gives rise to a magnetic 
field 

4=Vx Ve, /f = Vx A. 

r 

This magnetic field acts on the magnetic moment of the electron p = — poo' (a* acts on 
the wave function of the electron), which results in an additional interaction between 
the nucleus and the electrons and leads to the hyperflne structure of the energy levels 


— ([iff) = (v X V X ^ j = 
=f‘oKp((a'-V)(9p-V)y-(»' I). 


(21.40) 


In the first approximation we may assume that there are no preferred directions and, 
hence, making use of the equations 

(9'-v)(0p-v)= -I- (»'•»;) (21.41) 


and 


V*y = -4«J(r), 


^This should properly read; /Xp**^ 


1 
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we obtain 

(21.42) 


Consequently, in the first approximation, the interaction of magnetic moments— Just like 
the contact interaction— influences only ±e s state. The expression in (21,42) 

can be found from the following simple considerations. 

The spin matrices of the proton aj, and the electron a' must satisfy the relation 

|ft«(a-+a;)» = n«S(5+l), (21.43) 


where 6' is the absolute value of the total spin, which is equal to either zero (antiparal- 
lell spins) or to umty (parallel spins). Then 

{(»'*+»;•+ 2 ( 9 '-»;) i = s ( s + i ). 

Using the fact that a'- = 6, we obtain 

(a'.a;,) = 25(S+ l)-3. (21.44) 

Since integration when the B function is present gives 

^,8(/-) = |^/(0)|», 


we obtain the following expression for the shift of s levels (hyperfine structure) of 
hydrogen: 


AC,.=:|-^,^,-^[2S(S+l)-31, 


(21.45) 


where 


flo — 


irhel 


is the radius of the first Bohr orbit, and the value of (0) | * is taken 


from Eq, (20,14), 

Two cases should be distinguished: 

(1) Spins of the proton and the electron are antiparallel (S = 0); then 






(2) Spins of the proton and the electron are parallel (S = 1); then 


(21,46) 


== 


8 1 
I — 3 I^P 


(21.47) 


The difference between these levels represents splitting of the s term due to the 
interaction between the electron and the magnetic moment of the nucleus 


Ao) 


n 


^ 1 

3 n h^al' 


(21.48) 


If we use (21,48) to calculate the s-term splitting for the case n = 1, then substituting 
the value of [ip obtained In Rabi*s experiments and setting {Xq equal to the Bohr mag- 
neton, we find 


Aco^*^‘*“^= 1417 Me. 

On the other hand, a careful experimental verification of the splitting of this level by 
microwave spectroscopic methods has given the value 


Aaj«*P == 1420 Me. 
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The relativistic corrections and the corrections for the finiteness of the nuclear mass 

do not raise the frequency to the required value , The proton's 

magnetic moment has also been measured very accurately. To explain this anomaly, 
therefore, it remained to assume that the magnetic moment of the electron is not exactly 
equal to the Bohr magneton, but is instead somewhat larger. Kusch and Foley showed that 
to obtain agreement with experiment the magneuc moment of the electron must be taken 
to be 


!^el — Mo (1 + 5), 


(2L49) 


where, according to recent data. 


& = 0,001 16. 

Ihese considerations show that an electron will have a very small anomalous mag- 
netic moment /jl In addition to the Dirac or kinematic magnetic moment 

( ” Mu). We shall discuss the nature of the anomalous magnetic moments in the next 
chapter. 

Concluding this section, it is necessary to point out that the hyperflne structure can- 
not explain the Lamb shift of the 2s i/^ level (1.058 Me relative to the 2/7 1 /^ level). First 
of all, it follows from (21,48) that the splitting of the ‘2si/„ term is of the order of 200 
Me, and, in addition, the center of mass of the s terms is not shifted. Suppose, the 
level with S — 1 (parallel spins of the electron and the proton) is raised by a certain 
amount [see (21,47)]; then the level with S’ — 0 (antipar^lel spins of the electron and 
the proton) is lowered by an amount three times as large [see (21.46)], Since a state 
with S = 1 is three times more probable than a state with S = 0 (when S = 1 the 
spin may be directed along the z axis, opposite to the z axis and perpendicular to the 
z axis), the center of mass of the s states remains unaltered, so that it occurs in the 
same position as when the magnetic moment of ±e nucleus is neglected. The hyper- 
flne structure cannot therefore account for the Lamb shift. The theory of this effect 
will be consid ered in the next chapter, 

t Problem 2 LiS Find the shift of the Is levels in the light p -me sic atoms as a result 
of tn?TflBT!Pffre of the nuclear structure, taking into account the nuclear moaon and 
also the variauon of the wave function within a distance comparable to the size of the 
nucleus. 

Hint, The shift of energy levels should be calculated by the perturbation method. We 
use Eqs. (21,5) and (21,7) where tlie wave functions are those of hydrogen-like atoms with 
the meson mass substituted for the electron mass. Because of the finiteness of the 
nuclear mass, the motion of the nucleus has much greater influence on the position of 
the energy levels of a meslc atom than on the levels of an ordinary atom. It is there- 
fore necessary to use the reduced meson mass in more accurate calculations. The 
wave funcuons of a meson change appreciably inside the nucleus; therefore, the quantity 
1 I' can be replaced by its value at the origin only in a comparatively crude qualita- 
tive estimate, as in the derivation of (21.14), 

Answer, In the general case 



where 


For small b and A I 






Eio — 





CO, this expression may be obtained from Eq, (21. 12)^ 
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The Electron-Positron Vacuum and the 


Electromagnetic Vacuum 



IRAC THEORY OF “HOLES.” 

THE POSITRON 


DISCOVERY OF 


The Dirac theory, which includes spin effects and relativistic effects, was able to 
account for the fine structure of the spectral lines of hydrogen-like atoms and the 
anomalous and normal Zeeman effects, T^e Dirac theory, however, also gave rise to 
a number of major difficulties in connection with the interpretation of negative energy 
states. These difficulties were not overcome for some time, but they eventually led to 
fundamental new discoveries In relativistic quantum mechanics. 

In our treatment of the motion of a free particle (Chapter 17), we mentioned that the 
Dirac equation allowed solutions corresponding to botli positive and negative values 
of energy. It is worth noting in this connection that solutions with negative energy are 
not characteristic of the Dirac theory alone, but appear in any relativistic theory. In 
relativistic mechanics, the energy of a free particle is connected with the momentum 
and rest mass by the well-known expression 


which has two roots 


£=+ YcY^ mic*. 

The regions of positive and negative energies are separated by an interval equal to 
2inQC^ (see Fig, 22,1), At first glance, states with negauve energy do not appear 
to have a real physical meaning, since the region of negative energies extends to infinity 
(IS = — oo) and, therefore, there is no lowest 
state. This would imply that no ordinary state 
is stable, since a spontaneous transition to a 
lower energy state would always be possible. 

Furthermore, a particle with negative mass 
(negative energy) would have a number of 
strange properties: for example, it would repel 
a particle with positive mass. 

In classical physics, states with negative 
energy do not cause any difficulties, because 
the energy of a moving particle can only change 
continuously; therefore, transitions from the 
states with positive energy to the states with 
negative energy cannot take place, since the 
energy would have to change dlscontinuously by 
the amount Defining the energy 

to be positive from the start, we may, therefore, 
neglect the negative energy states. 

The situation is quite different in quantum theory, where transitions can take place 
between the states in a discrete spectrum, as well as in a continuous spectrum. States 
with negative energy cannot be excluded simply by defining the energy to be positive 


£ 

- 



-/7?gC^ 


Fig. 22.1. Allowed energy levels of a 
free Dirac particle 
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2moC^ 


at the Initial time, because the probability of the transition between the states with 
energy -f and — Is not equal to zero. 

In order to avoid transiuons of electrons to negative states, Dirac suggested (1931) 
that we regard all negative energy Jevels as occupied by electrons (see Fig. 22,2), 

so that an electron with a positive energy cannot 
Jump into a negative energy level under ordinary 

- conditions.^ The state in which all negative 

- - ■ ^ energy levels and no posiuve energy levels are 

occupied is called the "electron vacuum," 

Let us now assume that a T-ray photon with 
energy t > 2//ioC^ excites an electron from the 
electron vacuum into a positive energy state. In 
this case, the absorbed 'Y-ray photon will be 
replaced by an electron with positive energy, and 
a "hole" will appear in one of the negative energy 
states (see Fig. 22.3). 

The decisive factor that led to the success of 
Dirac's hypothesis was that he interpreted the 
"hole" as a particle (the "positron") with posi- 
tive mass equal to the mass of the electron, but 
with the opposite charge, ^ Let us suppose that 
tliere are no particles at the initial time. Then 
the "zero-point energy" Eyac energy of the electron vacuum) Is equal to the sum of 
the energies of the electrons in the negative energy states 


Fig 22.2. Zero-point energy dia- 
gram of the electron-positron 
vacuum 




( 22 . 1 ) 


The "zero-point charge" is equal to 



^'o. 


n— 


Thus, from the standpoint of the hole theory, the absence of any real particles means 
that all positive energy states are empty, and all negative energy states are occupied, 
ITils case corresponds to the electron vacuum (see Fig. 22.2), 

When an electron jumps from a negative energy state n_ to a positive energy state , 
the total energy change of the system is 

Az: = + y - y e„, = e - e„ ■-= /• + 1 ;•„ | ( 22 . 2 ) 

-I Jam — dam — + - + — • 

n' n’ 


This change represents the sum of the positive energies of two nascent particles,^ Similar 
arguments with regard to charge show that the charge of the nascent particle correspond- 
ing to the "hole" is opposite to the electron charge 

e = e,-i ^eo = -e^^ + e„_=-eo + e,. (22.3) 

n' n’ 


'in accordance with the Pauli exclusion principle (see Chapter 6), only one electron 
can occupy each state 
2 

A similar conclusion was reached in the treatment of the hole theory of semiconductors 
(see the discussion of the band theory of conduction in Chapter 6) 

^The prime attached to the summation sign (^0 means that the summation extends over 
all states uL except the state uL - 
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Thus, the transiaon of an electron from a negative energy state to a positive energy 
state (as a result of the absorption of a 7-ray quantum with energy greater than 2/nof“) 
leads to the creation of a pair of particles. The unoccupied negative energy state (‘*hole”) 
may be regarded as a state occupied by a particle with positive charge + and positive 
energy.'* This particle, which was predicted by 

Dirac, was called a “positron” and was dls- : 

covered by Anderson in cosmic radiation (1932), 

Once it Is interpreted in this way, the Dirac 
theory describes in a natural manner both the 
electron and the positron. The positron is an 
antiparticle; its wave function satisfies the Dirac 
equation with positive energy and positive charge 
(see Problem 18.5), 

In the Dirac theory, pair annihilation— a 
process which is the reverse of pair creation— is 
also allowed. This process takes place when an 
electron with positive energy jumps Into a hole. 

In this case the electron and positron are con- 
verted into 7*-rays. 

In these transformations, the laws of conser- 
vation of energy and momentum are rigorously 

obeyed. As already mentioned, pair creation due to the absorption of a 7-ray photon can 
occur only in the presence of a third particle (for example, a nucleus), that takes up the 
excess momentum of the photon 



Fig. 22.3. Formation of an electron- 
positron pair 


Y -f- 2^0 — ► Zcq e^-\' (22,4) 

Similarly, the conversion of an electron-positron pair into 7-rays takes place in accord- 
ance with the laws of conservation of energy and momentum; as a result of pair annihila- 
tion, at least two 7-ray photons are created 

= (22,5) 

In order to show this, we may choose, without the loss of generality, a coordinate system 
In which the electron and positron move with opposite momenta, so that^_=:— 

(the center-of-mass system). Then, according to the law of conservation of momentum, 
the total momentum of the two photons which are formed as a result of annihilation must 
also equal zero 

n{ki-\-k>^==.0. (22.5a) 


Using the law of conservation of energy 

cn {k, 4- ki) --^2 f -f (22.5b) 

we find that — Afj, and the energy of each of the photons is equal to 

e ~ cflki — cUk^ = Y mlc^ -f" 

The lowest value of the photon energy is obtained when = 0 (that is, when the electron 
and positron are at rest). Then t min — These two 7-ray photons move apart with 
the same energy and oppositely directed momenta. It is easy to see that the electron- 
positron pair cannot be converted into a single 7 -ray photon (A- = 0), since the laws of 
conservation of energy and momentum cannot be satisfied simultaneously with only one 
7 -ray photon. 


“^Wjth the help of quantum field theory, we can construct a theory of the electron- 
positron vacuum that is symmetric with respect to charge. However, even with the theory 
described above, which is asymmetric with respect to electrons and positrons (an electron 
IS a particle, whereas a positron is a “hole”) we can give a clear explanation of many 
phenomena involving the transformation of particles 
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The law of conservation of total angular momentum (orbital plus spin) is also very 
important in the annihilation processes. If an electron moves with a nonrelativistlc 
velocity, this law (as shown in Chapter 18) can be resolved into a law of conservation of 
orbital angular momentum and a law of conservation of spin. 

The law of conservation of spin can be observed particularly clearly in the annihila- 
tion of positronium (a hydrogen atom in which the proton is replaced by a positron, or 
more exactly, a system in which an electron and a positron rotate about their common 
center of massX In this atom, the nucleus (that is, the positron) and the electron have the 

same mass, and therefore the reduced mass is The energy of levels in the positron- 
ium atom will, therefore, be one half as large as in the hydrogen atom 

E 

while the radius of the orbit will be twice as large. The velocities of the electron and 
positron may be regarded as nonrelativistlc. Just as in the hydrogen atom. If the spins 
of the electron and positron are antiparallel (parapositronlum), positronium can decay 
into two 7-ray photons (the correspon^ng mean life is 1 25 • sec). 

The total spin of parapositronlum is equal to zero, and therefore the two photons move 
apart with opposite directions of spin (that is, their total spin is zero). 

If, however, the spins of the electron and positron are parallel (orthoposltronium) the 
system must decay into three 7-ray photons (the corresponding mean life is 1.4 • 10“^ sec), 
TTie spin of orthoposltronium is equal to unity. Orthoposltronium cannot decay into one 
7-ray photon with spin 1, because in this case the law of conservation of momentum 
would be violated. It cannot decay into two 7 -ray photons because then the law of con- 
servation of spin would be violated (the total spin of two 7-ray photons is either two or 
zero). Only if orthoposltronium decays into three 7-ray photons will the law of con- 
servation of momentum and the law of conservaaon of spin be satisfied. 

The discovery of the positron opened a new stage in the study of elementary particles. 
This discovery showed that particle shad a new fundamental property— interconvertibility— 
and confirmed the existence of antiparticles. We can regard the creation of a positron 
as the conversion of a 7-ray photon into an electron-positron pair, and the annihilation 
of an electron-positron pair as the conversion of an electron-positron pair into 7-ray 
photons. 


B. THE LAMB SHIFT OF ENERGY LEVELS OF 
ATOMIC ELECTRONS 

When an electron moves in an atom, it interacts with the electromagnetic vacuum, 
as well as with the atomic nucleus and the electron-positron vacuum, Ihe interaction 
between the electron and the electromagnetic vacuum exerts a particularly strong 
influence on the mouon of the electron in the atom, and it explains the shift of the 2Si/^ 
level upwards relative to the level (in the hydrogen atom), ^ 

A complete theory of this phenomenon can be constructed only by means of quantum 
electrodynamics, which is based on the theory of second quantization. But even without 
referring to this tlieory we can sull obtain the appropriate equations with accuracy up to 
coefficients of the order of unity, while using comparatively simple physical arguments. 

One of the basic ideas of the quantum field theory is that each wave or field can be 
associated with a particle. 'Ihus, for example, the Dirac waves correspond to electrons 
and positrons, and light waves correspond to photons. It is very well known, however, 
that MaxwelTs equations describe not only light waves, but also electrostatic and magnetic 
fields, which depend on charges and their velocities (an accelerating charge produces 
electromagnetic or light wavesX The electrostatic and magnetic fields can be associated 
with **pseudo-photons,” which have observable effects only in the presence of charges. 
An electrostatic field can be expanded in a Fourier series, that is, schematically repre- 
sented as a set of oscillators with different frequencies. An analogous expansion holds 
for the vacuum field of "pseudo-photons'* 


( 22 . 6 ) 



ELECTRON-POSITRON AND ELECTROMAGNETIC VACUUMS 


351 


Since the rest mass of a "pseudo-photon** Is zero, we may write the relation between its 
frequency and the wave number k as follows: 

“ = ¥ = = + + (22.7) 

The components kp are related to the Integers Wi, Wg, and the period L (see 
Chapter 6) by the expressions 


. 2nni j r 1 . 

kjc = -j— , and so forth. 


where /ii = :t 1, ±2, ±3 Hence 

A/z 


27tAni 

" L 


‘In 




so that 


^k^^ky^k^:=::k^dkdQ, 


u-rdijH 8j:® 


If the system is spherically symmetric, this relation may be written as 


J_ 


di^ 1 
c® 271^ 


( 22 . 8 ) 


With the aid of Eq, (22.6), we find that the energy of the electrostatic field Inside a region 
of volume U is 


__I_ 

471 



U) 


(22.9) 


In deriving this expression, we have used the relation 

(cos<o<costo7)av = Y6,„,„-. 

Just as in the theory of the harmonic oscillator, the energy of the field in the lowest 
energy state is not equal to zero, but Instead is equal to the sum of the zero-point 
energies of the harmonic oscillator (see Chapter 10) 


eo — 2 





(22.10) 


The coefficient 2 in front of the summation sign takes into account the fact that each 
harmonic corresponds to two different states of polarization of the * ‘pseudo-photons.** 
In the state where there are no real photons the total energy of an electromagnetic 
field (the vacuum), must be equal to this zero-point energy (t = so). Hence, taking 
(22,8) into account, we find 


(E (»))» = i,- ft<» = -^ . (22. 10a) 

In the absence of real parades and external fields, the vacuum (including the electro- 
magnetic one containing photons and pseudo-photons) does not, as a rule, have any 
observable effects because it is isotropic. On the other hand, when real particles 



352 


RELATIVISTIC QUANTUM MECHANICS 


and external fields are present, the Isotropy is disrupted and virtual particles ("pseudo- 
photons” or electron-positron pairs) are created and subsequently annihilated (vacuum 
fluctuations). 

A simple physical pictiure of a few basic notions involved in the theory of the vacuum 
was obtained somewhat unrlgorously by Welton by means of a semlclassical, nonrelativistic 
treatment of the motion of an electron, taking Into account its Interaction with the vacuum 
fluctuations. As an Illustration of Welton's calculations, let us attempt to give a more 
concrete meaning to the zero-point fluctuations (22, lOjL In a rough approximation we 
can use the ordinary classical equation 

m„r = f£p.p. (22.11) 

to find how an electron will be affected by the vacuum field of "pseudo-photons.” Ex- 
panding £p p in a Fourier series 


^p.p. 




cos 0 )^ 


( 22 . 12 ) 


and integrating Eq, (22,11), we find the displacement of the electron positron due to the 
vacuum field: 


r 



cos Oit 


The mean-square deviation of the position is given by 


(22.13) 




(22.14) 


since 


cos 0, cos (lif cos (O'f 

and therefore f=0 Substituting (22.10a), we obtain a divergent integral 


(Ar)' 


n he \ niQC j j ^ * 


(22.15) 


from which a finite (observable) part can be separated out if the range of variation of the 
frequency is cut off from above at a frequency corresponding to the rest energy of the 
electron 


^ moc* 
“^max ^ n » 


(22.16) 


and from below at a frequency corresponding to the minimum energy of the electron in 
an atom ’’ 


1 — EiA 

If- 2nhi^- 

SubsUtutlon of (22.17) and (22.16) into (22.15) gives 




2 

n 



(22.17) 


(22.18) 


The limits of the range of variation of are specified more accurately in renormali- 
zation theory 
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Thus it can be seen that the vacuum field of "pseudo-photons" will cause the electron 
to perform a motion somewhat resembling Brownian motion with a definite value of the 
mean square displacemenL It is well known that the Brownian motion of a particle is due 
to its collisions with randomly moving molecules of the surrounding medium. In similar 
fashion, an electron undergoes "collisions" (of a special kind) with the assembly of 
virtual particles forming the vacuum. 

As was shown by N. Bogolyubov and S. Tyabllkov, ±e vacuum fluctuations cause a 
certain "smearing out" of a point electron. As a result, the electronic radius turns out 
to be the geometric mean of the classical radius and the Compton wavelength 


r 




moC* m^c 


= / 


a 


m^c' 


(22.19) 


The existence of this effective radius should have several consequences. In particular, 
the interaction between the electron and the nuclear charge should be changed, and this, 
in turn, should lead to an additional coi^ling energy and thus to a shift of the energy 
levels, llie usual expression for the potential energy of an electron In the field of a 
nucleus 

V^-e^^(r) (22.20) 


is replaced by 

V''==-f„'P(r + Ar) = -f„ II +(irv)+Y(A/-v)'+ (22.21) 

Changing to the average values and using the relations 

aT- = 0, (KFW = J 7*. (22.22) 

we obtain 

V/' = -^o{|+j(aJT*V* + ••• }«■('■)• (22.23) 

The additional energy of interaction between the electron and the nucleus is given by 


since the Coulomb potential of the hydrogen nucleus satisfies the Poisson equation 

v2.^=:_4n^^,S(r). (22.25) 

To obtain an expression for the shift of energy levels in the hydrogen atom, it is necessary 
to average the additional interaction energy over the corresponding state 


A/^p. p = J (V - 10 1 (r) I* d>x = I el<x ( AJ I ^ ( 0 ) |. m ^ . (22.26) 

ITiis shift occurs only for s states, since the quantity || (0) |“ln the approximation under 
consideration vanishes for other states (/= 1, 2, 3, ,.,), whereas for the s state 


!'K0)r = 


1 

Tzn^ay 


(22.27) 


where oq = h^/moel is the radius of the first Bohr orbit of the hydrogen atom, and n is the 
principal quantum number. If we now substitute this value into (22.26), we obtain a 
formula for the s -level shift 


= — a» — In -T, • 
P* P- 3n n* 


(22.28) 


This formula was first derived by Bethe, 
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A substitution of numerical values for the 2s state (n = 2) gives 

17,8/?= 1040MC . (22.29) 

This is in fairly good agreement with recent experimental data (A£= 1057,77 Me) for 
the Lamb shift (see Chapter 20), 

A complete study of the shift of energy levels of atomic electrons with the help of 
relativistic quantum field theory gives considerably better agreement between theoretical 
and experimental results than the semiclasslcal formula (22.28X The discrepancy is 
reduced to less than 1 Me, 

For the sake of brevity, we shall not discuss the modem theory of the vacuum in 
greater detail, and we shall confine ourselves to a mere enumeration of the main results 
that this theory has yielded. 


C. ELECTRON-POSITRON VACUUM 


Equation (22.28) for the Lamb shift was obtained from a calculation of the interaction 
between electrons and the electromagnetic vacuum. In addition to the electromagnetic 
vacuum, there exists an electron-positron vacuum and also vacuum states of other 
particles. The method of second quantization (which is in some measure applicable to 
all fields) can be used to calculate the influence of the electron- positron vacuum. 

In modern quantum field theory, the study of the properties of the vacuum states of 
different particles plays a particularly important role. TTie vacuum gives rise to inter- 
actions between particles. In particular, the electromagnetic interaction of two electrons 
(Coulomb's law) may be regarded as an interaction which takes place through the electro- 
magnetic vacuum, with one electron emitting a virtual photon and the other electron 
absorbing it. 

On the other hand, the vacuum represents a sort of “reservoir," from which real 
particles are "drawn" when they are created, and to which they "return" when they are 
annihilated. We have already come across the electrorv-positron vacuum as a "collection" 
of electrons in negauve energy states. Unfortunately, it has no classical analog and 
therefore we cannot use a semiclasslcal analysis^ as was possible In the case of the 
electromagnetic vacuum. The Coulomb field of a nucleus can polarize the electron- 
positron vacuum (so that an electron will behave as if it were in a dielectric), giving rise 
to an additional interaction 



Comparing (22,30) with (22.24), we see that the level shift due to Interaction with the 
electron-positron vacuum is about 1/40 times as large as the level shift due to the fluc- 
tuations of the electromagnetic field, and is opposite in sign. 

The electron-positron vacuum exerts a particularly strong Influence on the magnetic 
properties of the electron. It was shown by Schwinger that the magnetic moment of the 
electron becomes somewhat larger than the Bohr magnetron 

= (2431) 

The change in the magnetic moment of the electron, with the second-order term taken 
into account. 


4(i,^.p= 0.328 ^ ) Ko = - 0.001 1596 (lo (22.32) 

is in good agreement with experimental data obtained with the aid of microwave spectro- 
scopic methods (see Chapter 21 X Qualitatively, the appearance of the additional magnetic 
moment of the electron and the sign of the moment can be explained as follows. The Initial 
electron A (see Fig. 22.4), whose spin is directed upwards (we specify in this manner a 
preferred direction), creates a virtual pair— electron A' and positron j 5'— which generally 
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have opposite spin directions. Since we have a preferred direction (determined by the 
spin of electron A) the spins of electron A' and positron B' can be directed in two ways: 
either the spin of A will be antiparallel to the spin of A' and parallel to the spin of B' 
(case 1 in Flg« 22,4), or the spin of A will be parallel to the spin of A' and antiparallel 
to the spin of B' (case 2 in Fig. 22.4). 

In determining the additional magnetic mo- 
menty It is necessary to consider the following 
possibilities of pair creation and annihilation: 

(a) Palr^'B' is created and then annihilated. 

This Is possible in case 1 and case 2. No cor- 
rection to the magnetic moment should arise, 
because the probability of the creation process 
is the same in both cases, and the magnetic 
moments have opposite signs in the two cases. 

(b) Since A and B' have opposite spins in 
case 2, positron B' may be annihilated together 
with /I', as well as with the imaal electron A^ 

In case 1, however, this process is less prob- 
able, since the spins of pair A and B' are 
parallel to one another. This process gives a 
preferred state in which the additional magnetic 
moment is parallel to the initial magnetic mo- 
ment of electron A, since, as can be easily 
seen, the magnetic moment of all three particles 
(/4, A' and B') is directed downwards. Conse- 
quently, there arises an additional magnetic 
moment of the electron, equal to 


. 1,1 1 

,1 1 1 

/ 

2 

Antiparallel 

Parallel 

spins of 

spins of 

electrons 

electrons 

A and A' 

A and A‘ 


^-ZNTr. 


Fig. 22.4 Virtual creation of an elec- 
tron-positron pair by an electron The 
spins of the real and virtual particles 
are depicted by solid and dashed ar- 
rows, respectively 


where is a numerical factor which determines the probability that the initial electron 
A will be annihilated together with positron B\ 

These simple considerations give the correct sign of this anomalous magnetic mo- 
ment, More rigorous calculations led to Eq, (22,32), 

In addition to the Dirac magnetic moment, p =3 — i^oO, an electron will have an 

a 

anomalous magnetic moment |i „ =• —o'l^oPe®, which arises owing to the interaction 

an j 7 t 

between the electron and the electron-positron vacuum. 

The anomalous magnetic moment of nucleons (neutrons and protons) can be explained 
in meson theory. The accuracy of the results, however, is much less striking than In 
calculations of dte anomalous magnetic moment of the electron. This is due to the fact 
that meson theory is still in a much less satisfactory state dian quantum electrodynamics. 

According to the meson theory, protons and neutrons interact with the ir-meson field. 
Since the proton can dissociate into a neutron and a positively charged ir meson, and the 
neutron into a proton and a negatively charged tt meson, the anomalous magnetic moments 
of protons and neutrons due to the ir-meson field must be approximately equal In magni- 
tude and opposite in sign. 

The appearance of the additional magnetic moment of a neutron is explained as follows. 
The neutron has a definite probability of dissociating into a proton and a ir meson. Since 
the spin of a tt meson equals 0, it does not possess an Intrinsic magnetic moment. There- 
fore, there will be a contribution to the magnetic moment of the neutron from a pion 
which is, for example, in a p state (we recall that the orbital angular momentum in the 

s state is also equal to 0), The magnetic moment of a t meson is ^ 7 times greater 

than the nuclear magneton, and therefore the t meson produces the’*main contribution to 
the anomalous magnetic moment of the neutron. For the virtual process to fulfill the 
law of conservation of angular momentum, it is necessary that the direction of the 
orbital angular momentum (equal to 1) of the virtual meson should coincide with the 
direction of spin of the neutron, while the spin of the virtual proton (equal to */». Just 
like the spin of the neutron) should be directed opposite to the spin of the neutron so that 

— s^ = Sp=™, = Since the tt meson has a negative charge and its 

angular momentum Is parallel to the spin of the neutron, it gives rise to the negative 
magneuc moment of the neutron. 
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In order to explain the magnitude of the anomalous magnetic moment of the neutron, 
one must assume that the neutron spends V4 of Its time In its dissociated state. This 
estimate Is perfectly reasonable and gives the correct sign and order of magnitude of the 
magneuc moment of the neutron ((x^^ -”2[Xnm;). Similarly, it can be shown that the dis- 
sociation of the proton into a neutron and a tz+ meson gives rise to a posiuve anomalous 
magnetic moment of the proton -j- Moreover, the proton has a Dirac mag- 

netic moment — l^nucX More accurate data on the magnitude of the magnetic moment 

of the proton and neutron are given in the preceding chapter. 

Thus we have a basis for regarding the anomalous magnetic moment of Dirac particles 
as a secondary effect which can be explained on the basis of the field ±eory. This 
moment does not appear in the initial equations, but arises as a result of the interacaon 
either between the electric charge and the electron-positron field (in the case of electrons), 
or between the nuclear charge and the 7r-meson field (in the case of protons and neutrons). 


D, RENORMALIZATION 


One of the most important sections of modem quantum field theory is concerned with 
the question of renormalization, Ihls subject is not yet in a mathematically satisfactory 
state, but a number of Important results have been obtained. 

The basic Idea involved In renormalization is the separation of finite, observable 
terms from the divergent terms describing the interaction between an electron and the 
electromagnetic or electron-positron vacuum. 

In effect, the question of renormalization first arose in classical electrodynamics— 
for example, in tlie theory of the electromagnetic mass of the electron. If we assume 
that the electron charge is distributed Inside a sphere of radius ro, its classical electro- 
magnetic mass will be equal to 


m 


field 



(22.33) 


where 7 is a factor of the order of unity vWch depends on the charge distribution inside 
the sphere. The attempts to construct a classical electrodynamics in which a finite 
radius ro^ 10 cm would give a reasonable value for mfiela (the Lorentz theory, the 
nonlinear Bom-Infeld theory, the Boppe- Podolsky theory with higher derivauves) did not 
give any satisfactory results. Moreover, all these theories gave rise to fundamental 
difficulties in connection witli quantization. 

On the other hand, the theory of a point electron leads to an infinite value of the mass 
as To 0 both in the classical and quantum forms of the theory. It was therefore a major 
achievement of renormalization in modern quantum field theory that it was able to 
separate, from the infinite interaction energy, finite terms associated with the Lamb 
shift of the atomic levels and the additional magnetic moment of the electron. 

Modem renormalization theory extends to the problem of the self-mass and charge 
of the electron. It has been found that when the electron-positron vacuum and the electro- 
magnetic vacuum are taken into account, the electromagnetic mass (22.33) drops out 
completely. Therefore, the main mass of the electron (the so-called **bare'' mass) will 
not be associated wltli die electromagnetic field. The interaction with the vacuum (in this 
case the main contribution is obtained when the electromagnetic vacuum and the electron- 
positron vacuum are taken into account simultaneously) yields an additional mass which 
diverges only logarithmically 


In‘^. (22.34) 

bare moc- 

Renormalizauon is faced with the important quesaon of finding the value s max 
which the logarithmically divergent expression should be cut off. This problem is a 
theoretically important one, although in practice the logarithmically divergent terms may 
be regarded to be of the order of unity, and therefore the quantity Am will remain of the 
order m bare over a comparatively wide range of values emax. 
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In exactly the same way, the interaction of the electron with the electron-positron 
vacuum (rather than with the electromagnetic vacuum) leads to a decrease of Its "bare 
charge" by the amount 


A — ae In 

bare 


^max 


(22.35) 


The field corrections to the mass and charge of the electron have not yet been experi- 
mentally separated, and this important subject requires further investigation.** 


When can the emission of photons take place for the free motion of an 
elecWWMfMPHfSSliim with index of refraction n 1 (the Cherenkov effect)? 

Why does this emission become Impossible in a vacuum 1)7 
Solution. In order for the emission to be possible, it is necessary that the laws of 
conservation of energy 


Y mic* -f- c'p^ = ftco ^ mlc* f 


and momentum 


p=p'-l nk 

be satisfied, where p and p' are the initial and final moments of the electron and flk is 
the photon momentum. 

Squaring these relations and then subtracting onefiromthe other, we obtain the follow- 
ing expression for the cosine of the angle of photon emission: 

CO M r. <0* 1 

— 75p+ ;;/» 1. 

Using the relation for the index of refraction n of the medium 


e = /ico = c'hk = 


fikc 


(c' = is the phase velocity of light in the medium^, we obtain 

A i 1 * \ 

C03B=^+^(l--,). 


Radiation can take place when p/z >> 1 (that is, when the electron velocity v remains less 
than the phase velocity of light in vacuum, but becomes greater than the velocity of light 
in the medium c > v' > c'). In the classical case (h = 0), the angle at which the photon is 
emitted satisfies the relation 


cos 6 = . 

The emission of radiation in vacuum is impossible because for n = 1 we have 
cos 6 = — , and the velocity v cannot become greater than tiie velocity of light c. 


^See G Wentzel, Quantum Theory of Fields, New York* Interscience, 1949, A I. 
Akhiyezer and V B. Berestetskiy, Elements of Quantum Electrodynamics (trans ), London: 
Oldboume Press, 1962, N N Bogolyubov and D V Shirkov, Introduction to the Theory of 
Quantized Fields (trans ), New York: Interscience, 1959 
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Theory of the Helium Atom Neglecting 
Spin States 


! :r 

A, BASIC PRINCIPLES OF THE THEORY OF 
^ ^ MULTIELECTRON ATOMS 

The helium atom is the simplest multielectron atom; it con- 
sists of two electrons moving about a nucleus of charge Z = 2. In 
spite of its simplicity, this system exhibits several important 
features characteristic of the many-body problem in quantum 
mechanics. 

In classical theory two electrons can always be identified by 
subscripts 1 and 2, and the motion of each electron can be followed 
separately. According to quantum theory two electrons can, 
in practice, be distinguished from each other only when the dis- 
tance between them is large. If electrons 1 and 2 are so close 
to each other that there are points in space where their wave 
functions are both simultaneously different from zero, then, 
since electrons are identical particles, we shall be unable to 
distinguish whether an electron occurring at the point is electron 1 
or 2. 

This indistinguishability or identity of particles is a special 
feature of quantum theory. It gives rise to the so-called exchange 
forces, which have no classical analog. 

In multielectron atoms, spin properties are very important; 
these properties are neglected in the classical and Bohr theories 
and are taken into account only in quantum mechanics. In fact, 
only in the case of a one-electron atom can the spin corrections be 
neglected in the first approximation. This explains why the pre- 
dictions of the Bohr theory are applicable only to hydrogen-like 
atoms. The Bohr theory could not be extended to atoms with two 
or more electrons, since it could not account for either the ex- 
change forces or the spin states. 

In order to investigate the basic features of the quantum theory 
of the many-body problem, we shall consider in some detail the 
problem of helium- like atoms (for example, neutral helium, singly 
ionized Li^, doubly ionized and so on). 
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B.- SOLUTION OF THE PROBLEM OF THE HELIUM ATOM 
VIA METHODS OF PERTURBATION THEORY 

To start with, let us determine the physical nature of the ex- 
change forces which are related to the indistinguishability of 
electrons. We shall neglect the spin properties of the particles.^ 
We shall assume that the position of 
the first and second electrons is given 
by the position vectors ti and re- 
spectively (their origin coincides with 
the stationary nucleus; see Fig. 23.1). For 
the sake of brevity, states with quantum 
numbers (tii, /i, mO and {ru, li, tn^ will 
be denoted by, respectively, /q and (in 
this notation, therefore, n stands for the 
whole set of quantum numbers). If there 
were no interaction between the electrons, 
we could determine the motion of each 
electron separately from the Schrbdinger 
equation 



Fip: 23 1 The helium atom 


where 




H,=T,+V'„ 




(23.1) 


(23.2) 


and the subscript / takes two values; / = 1 in the case of the first 
electron, and i—2 in the case of the second electron. From these 
equations we can obtain the energy values E„^ (see Chapter 13) 




(23.3) 


The eigenfunctions will be the same as the wave functions of a 
hydrogen-like atom; they will satisfy the orthonormality condition 


5 tS; ('•) ('■) (23.4) 

If we now take into account the interaction of the two electrons 


V' = 


e 


2 

n 


el 


\ri~r,\ r ,2 ’ 


(23.5) 


^We are allowed to do this because in this approximation the problem has a solution 
in the form of a product of two functions, depending on the spatial and spin coordinates, 
respectively (see Chapter 24). 
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their motion can no longer be regarded as independent. The 
Hamiltonian of the complete system will be 

H = H, + H, 4 - 1 /' = W + V\ (23.6) 

and to describe this system we must take the Schrbdinger equation 
in the form 


r,) = 0. (23.7) 

where E is the total energy, and A) is the total wave function, 
which depends on the coordinates of both electrons. The quantity 
^i) characterizes the probability of finding the first 
electron at position ri and the second electron at position 
Therefore, the normalization condition for r^) takes the form 

^ (^1- fi) t (''i. '■») = * ' (23.8) 

where the integration extends over the coordinates of both particles. 

Since the exact solution of Eq. (23.7) entails insurmountable 
difficulties, we shall use the perturbation theory^ developed in 
Chapter 14, assuming that the mutual interaction of the electrons 
causes only a small change in their individual motions in the 
Coulomb field of the nucleus (the justification for this approxima- 
tion will be examined in greater detail later). 

We shall first consider the zeroth approximation, in which the 
perturbation energy V' can be neglected. The Schrbdinger equation 
(23,7) becomes 

r,) = 0. (23.9) 

Since the zero-order Hamiltonian H“ can be resolved into a 
sum of two Hamiltonians H, -|- Hj, each of which depends only on 
a single variable (either ri or r^), it is obvious that in the zeroth 
approximation the wave function may be written in the form ( 41 " = u) 

W = (a) ' j'na (^j)' (23.10) 


Substituting (23.10) into (23.9) and using (23.1), we get 


(£“ - H") « = {£" - (H, + H,)} 4 ... {r^) 4<„, (r,) = 
^ £"« — { 4 )„, (r^) Hi4„, (r,) + 4<„ (r,)Hi4„, (r^l = 
= £®h — (ri) £„.4i„, (r,) + 4i„, (rO £„,4(„, (rj)} — 
= {£»-(£„, + £„,))« = 0. 


The problem of the heluim-like atom is a three-body problem and cannot be solved 
exactly, even in the classical approximation. 
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Hence, the energy in the zeroth approximation 

E^ = En, + En,. (23.11) 

where Ef^ and En^ are the energies of the two electrons on the as- 
sumption of no mutual interaction. This result can be explained as 
follows. In the absence of the perturbation V\ the motion of the 
electrons is determined by their interaction with the nucleus Ze^; 
that is, their motion is completely described by the Schrodinger 
equation (23.1) which has the eigenvalues Enj [see (23.3)] and 
eigenfunctions ^nj . Since one of the electrons is in a state rji and 
the second in a state the total energy of the system is + En^ 
when V'==0. 

Since the two electrons move independently of one another, the 
total wave function will be the product of the corresponding two 
independent one-electron wave functions obtained in the one- 
electron problem. By direct substitution into Eq. (23.9) it is, 
however, easy to show that, in addition to the first solution (23.10), 
there will be a second solution (tp®=ri;) corresponding to the same 
energy eigenvalue (23.11): 

= (n)‘ (23.12) 

This solution differs from w by a permutation of the electrons. The 
first electron is now in state and the second electron is in 
state «i. 

This state of the systemhas, therefore, an additional degeneracy 
which is due entirely to the indistinguishability of the electrons; 
this IS known as exchange degeneracy. 

If both electrons are in identical states the wave 

functions u and v are identical and there will be no exchange de- 
generacy since 

II = v = (ri) (23.12a) 

In the case rii ^ , however, functions ti and v are different, and 

therefore the following linear combination should be taken as the 
general solution of the Schrodinger equation (23.9): 


f = C,w + Qy, (23.13) 

where Cj and Cj are arbitrary constant coefficients, which are 
related only by the normalization condition 

^ = 1 . 

In order to find the values of the coefficients Q and and the 
energy E of the perturbed system (the system with V' taken into 
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account), according to the perturbation theory, we shall look for 
the solutions of E and in the form: 

£ = 

= .1)0 4- tj>'. (23.14) 

\Ve use a first approximation of the Schrbdinger equation (23.7), 
and write It in the form 

(£0 _ H«) = — {E' — V') (C,u 4- Cp). (23.15) 

From Chapter 14, we recall that the solution of the homogeneous 
equation for the unperturbed problem must be orthogonal to the 
right-hand side of the inhomogeneous equation for the first-order 
approximation of the wave function [see (14.13)]. Since the functions 
u and V are solutions of the unperturbed problem, we have 


«*•(£' 

- V') (C,H + CS d% ^ 0, 

(23.16) 

V* (£' 

- V) (C^u + Cfi) d% d% = 0. 

(23.17) 


In (23.17), let us substitute ri for ry and n for r,. The function 
^ [see (23.12)] then becomes u [see (23.10)], and the function y 
becomes u , while the perturbation energy V' remains unchanged 
since In — = — Thus Eq. (23.17) takes the form 

J H ' {E' — V) {CiU + CiV) il'x, c/'A'i 0. (23. 17a) 

It is, therefore, sufficient to consider only Eq. (23.16) since the 
results can be extended to Eq. (23.17a) by substituting Ci and 

^ Cl. 

Let us substitute into Eq. (23.16) the explicit expressions 
(23.10) and (23.12) for u and t;, and introduce the notation 


%An)%A'’i) = Pii(ri), (23.18) 

'I’»,(^9)'|'»2('‘>) = Pm('4). (23.19) 

('‘i)'i'-i,(ri) = Pi,(r,), (23.20) 

t/t, (rj) (/*,) = p„ (r,). (23.21) 


Here Pii(ri) and PmW characterize the probability density distri- 
bution of electrons in the states «i and nj, respectively, whereas 
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p„ (ri) and Psi (a) characterize the so-called density of mixed (or 
exchange) states,^ when each electron is partially Instate ni and 
partially in state riu 

The orthonormality condition gives 

^ H*« (Pxi (Pxt — J p„ (ri) dhi ^ p„ (ti) d\ = 1 , 

and 

u*v ifxi dhi = j pH (n) d\ ^ pj, (rj) d\ = 0, 

and therefore we can reduce (23.16) to the form 

£'C, - {C.eS 5 d% d% + 

+ C,el J dh^ = 0. (23.22) 

The first integral in Eq. (23.22) is simply the Coulomb interaction 
of two “smeared-out” electrons 

a: = ej J dhi d%. (23. 23) 

The second integral gives the so-called exchange energy 

t 

.4 == ej J (23. 24) 

corresponding to the interaction of two electrons when each is in 
a mixed state «i and ih. Unlike the Coulomb energy K, the ex- 
change energy A has no classical analog; it is essentially a 
quantum-mechanical concept. 

Using Eqs. (23.23) and (23.24), we obtain instead of Eq. (23.22) 
the following expression: 

C^{E' — K.)-C^A=(). (23.25) 

From Eq. (23.25), we obtain the equation corresponding to (23.17a) 
by substituting -> C, and C, -► Cj : 

Cj (£' — K) — C^A = 0. (23.26) 


These densities have no classical analog 
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Equations (23.25) and (23.26) give 

1) = + (23.27) 

and 

2) E^ = K — A, Ci==: — Cj. (23.28) 

Accordingly, we have two solutions for the wave function [see 
(23.13)] and for the total energy 

1) symmetric 

= (23.29) 

= + 4, (23,30) 

and 

2) antisymmetric 

— v), (23.31) 

E^ = E^ + K — A. (23.32) 

We can determine the coefficient C, from the normalizatior 
condition of the wave functions and 

Hence, 2Cf= 1 or • Thus for and we finally have 

= + (23.29a) 

(23.31a) 

When both electrons are in the same quantum state (rii — fii), 
the functions a and v are identical. In this case Eqs. (23.16) and 
(23.17) reduce to the same equation 

I u* (F — V) ud%({% ^ 0. (23.33) 

From this it is readily seen that 


E'=^K. 


(23.34) 


4 

We recall that under a permutation of coordinates (that is, when and r 2 are inter- 
changed) the function u and u transform into one another The wave function 0** does not 
change sign as a result of this operation (symmetric function), whereas the function 0** 
reverses its sign (antisymmetric function) 
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and thus the exchange energy disappears in this case. For the wave 
function we obtain a unique symmetric solution 

= (23.35) 

with the corresponding energy of the system 

= (23.36) 

Summarizing the results of this section, we can make the fol- 
lowing statement: Application of the perturbation method to the prob- 
lem of the helium atom^ leads to two types of solutions— a sym- 
metric or antisymmetric solution. This is in complete agreement 
with the general theoryof systems of identical particles (see below). 


C. COULOMB INTERACTION BETWEEN ELECTRONS 

Let us find an expression for the Coulomb energy of two 
electrons in the lowest energy state (ni = / 2 j|=l). In this case the 
energy and the wave function of each electron are given by 


Er- 


' 2ao ' 




2\Va 

e 


(23.37) 


_ n» 

where is the radius of the first Bohr orbit. 

The Coulomb energy of interaction between the two electrons is 

K' = ^’ I-! (r,) t: (a) YFr-T^ \ (23.38) 

Here In — n j = cost), and ^ is the angle between the 

vectors n and n. To integrate (23.38), let us choose the direction 
of the -r axis along the direction of the vector n , Substituting the 
expression (23.37) for the wave functions into Eq. (23.38) and in- 
tegrating over the angles, we flnd^ 


^This IS due to the fact that the perturbation removes the degeneracy, and therefore 
the coefficients that were indeterminate in the yieroth approximation can now take specific 
values (See also Chapter 14, the Stark effect ) 

^In integrating over the angle i? (x ^ cos d). we used the relation 

^2 '‘2 • 

2 

■— for ri > r 2 



(continued) 
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2Zr2 


K = - - ^ rJ^/rje "0 J r^e "<» cfr^ 

“ *0 n 

Next, integrating over ri and /* 2 , we finally obtain 


^ ~ 8 "flo • 


Since the zero-order energy is 


£» = 2£, = — 

a„ ' 


(23,38a) 


(23.39) 


(23.40) 


the total energy of two electrons In the ground state is equal to 


+ K = + (23.41) 

Let us now find the ionization energy of the helium atom, that 
is, the energy that must be expended in order to remove one 
electron from the first orbit. For a singly ionized helium atom 
(a hydrogen-like atom) the bonding energy between the electron and 
the nucleus is simply [see (23.37)]. The ionization energy of a 
helium- like atom is therefore 

= (23.42) 

SO that for helium (Z = 2) we have 

=0.75'’“-. (23.43) 

^0 

According to experimental data, the ionization energy of helium 
is 

£‘°" = 0.9 = 24.48 ev. (23.43a) 

^0 


The discrepancy between the theoretical value and the experimental 
data is a result of the fact that the perturbation energy K= — 

4 Qq 

is not very small as compared with the zero-order energy | E^\ =~^ 
(their ratio is --^1/3). Perturbation theory, therefore, gives us 


Considering; that the integrand 4 ’i(X\) 2) symmetric with respect to the variables 

fl and £3, we replace (for rj £3) the radius £1 by £3 and vice versa, obtaining 

rj '■l '■ ''2 • 

0 for 1 j £3 

This expression was used in evaluating the integral (23 38 ) 
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only qualitative aspects of the problem. The accuracy of the method 
is not very great because /C and £« have the same order of magnitude. 

D. THE VARIATIONAL METHOD 

The variational method developed by Ritz, Hylleraas and others 
was first successfully used to find the ground -state energy of 
atomic systems, and in recent years ithasbeen applied in collision 
theory. 

As we know, the average energy of anatomic system is given by 

(23.44) 

If the wave function is represented as 

= (23.44a) 

where the coefficients C„ give the probability of an electron being 
in state n , the average value of the energy will be given by the 
relation [see (7.21)] 

^ = (23.45) 

n 

Replacing each eigenvalue £„ in the summation by the lowest 
eigenvalue and using the fact that for normalized functions 

n 

we find that 

Emin ^ ^ (23.46) 

that is, the lowest value of the integral ^ can be used to 

determine the upper limit of the ground-state energy of the system. 

It was found that the variational method gives very good results 
when the perturbation energy E' is of the same order as the energy 
Eo of the zero-order approximation. The variational method can 
therefore be used in cases where perturbation theory gives poor 
results (for example, in calculation of the ground- state energy of 
the helium atom). When a problem is solved by the variational 
method, both the additional interaction V' and the main interaction 
are treated equally in the Hamiltonian II of Eq. (23.7). A test 
function then depends on several parameters and is selected 
in such a way that the integral can be calculated exactly. The 
energy £ will then be a function of these parameters and it is 
obvious that the minimum value of this function will be close to 
the true value if the test function resembles the true function. 
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The most difficult part of the method lies in the choice of the 
best test function. All the available information on the properties 
of the system must be used in making this choice. It is impossible 
in the general case to indicate the form of the test function, and 
therefore it is frequently necessary to rely on physical and 
mathematical intuition. Very often, a test function is similar in 
form to the solution of the unperturbed equation. 

We shall now use the variational method to calculate the ground- 
state energy of the helium atom. Our procedure will be based on 
that of Hylleraas (1927). At the end of the discussion we shall 
compare both perturbation and variational methods. 

For the test function, Hylleraas chose the ground- state function 
(23,27) of the hydrogen atom, replacing the charge Z by a certain 
effective charge Z'. The quantity Z is the unknown parameter which 
has to be determined from the variational principle. The test 
function 


will obviously be normalized to unity, just like (23,37), since its 
normalization is independent of the value of Z'. 

The Hamiltonian H in (23.46) must include both the zero-order 
approximation Hamiltonian and the perturbation potential energy 
V' . We thus obtain 


H = Tj + Vi + Ta + V, + V\ (23.48) 

where and Vy(y=l, 2) are given by (23,2) and the perturbation 
potential energy V'is given by (23.5). 

Since the wave functions are normalized and since + does 
not depend on ri and is independent of the coordinate 

the average value of the Hamiltonian is 


where 



(23.49) 


(23.50) 


(23.50a) 

V' = J -Pf (r.) (r,) d%d%. 

/ (23.50b) 
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The integral (23.50b) agrees exactly with integral (23.38); therefore, 
setting Z = Z', we obtain in accordance with (23.39) 

(23.51) 

O ^0 

The quantity f, in Eq. (23.49) represents the average value of 
the kinetic energy of a hydrogen-like atom with atomic number Z' 
when the electron is in the lowest state. This value Is related to 
the total energy of the hydrogen-llke atom by the well-known ex- 
pression 




(23.52) 


In exactly the same way, replacing Z'in (23.50a) by Z, we can ob- 
tain the average value of the potential energy of a hydrogen-like 
atom, since U is well known that the potential energy is twice the 
total energy (l^ “2E1 ). Consequently, 



zrei 

* 




(23.53) 


It follows from (23.49) that the average value of the energy is 
given by the expression 

£(2')==^(2'*— 2ZZ' + |-Z'). (23.54) 


which is a function of the parameter Z', 

Let us now find the value of the parameter Z' corresponding to 
the minimum energy of the system. Differentiating E (Z') with re- 
spect to Z' and setting the derivative equal to zero, we find 



The minimum energy of electrons in the helium atom is therefore 

(23.55) 

For the Ionization energy we have 



In particular, for helium (Z = 2) we obtain 


(23.56) 
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This result is considerably closer to the experimental value [see 
(23.43a)] than (23.43) obtained from the perturbation theory. 
Hylleraas later improved the agreement with the experiment by 
using several variational parameters. The result (23.55) for 
has a simplephysicalinterpretation, namely: the interaction between 
the electrons results in a screening of the positive nuclear charge. 

The variational method can also be used to find the upper energy 
limit of one or several excited states. In this case the test function 
has to be chosen in such a way that it is orthogonal to the wave 
functions of all the lower states. When the energy levels are 
arranged in order of increasing magnitudes (Eq, Ei, E 2 . . .), it can 
easily be shown that since (p is orthogonal to functions ^ 2 . • .f 

the corresponding expansion coefficients Q, Ci, C^,. . .are all equal 
to zero. Therefore, in accordance with (23.45), the energy is 


00 

£=2 (23.57) 

rt=rtO 


where tiQ is the quantum number of the given excited state. Using 

the relation we find that the minimum value of this 

energy corresponds to the unknown energy of the excited state. 
The application of this method to the calculation of the energies of 
the highest excited states is rather difficult, because of the neces- 
sity of introducing a large number of additional conditions to ensure 
that the wave function of a given state will be orthogonal to the 
wave functions of the lower states. 


E. DERIVATION OF THE SCHRODINGER EQUATION 
BY THE VARIATIONAL METHOD 


We shall consider one of the most general forms of the variational problem, when 
the choice of the test wave function 4* that is used to find the average value of the 
Hamiltonian 


E = 



(23.58) 


describing the motion of a single particle is restricted only by the normalization con- 
dition 


J (23.59) 

Upon varying B with respect to and using the hermlticity of the operator H, we ob- 
tain 




(23.60) 
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Here the variations 84' and 84 * cannot be regarded as independent since they are con- 
nected by the normalization condition (23,59). These variations can be made Independent 
by varying the condition (23,59) 

^ S-^* <1- rf’AT + ^ S4”)/* d>x = 0. 

Multiplying this equation by a constant Lagrangian multiplier I , which is chosen in such 
a way that the variations are now independent^ we add the resulting equation to (23,60), 
Since the variations 84 and 84 * are now arbitrary, the variational principle gives auto- 
matically the Schrodinger equation for 4 and 4 ** 

(H-~£)4 = 0, (H*^ — £)4* = 0, (23.61) 

and the physical meaning of the parameter A becomes clear: it is the negative of the 
energy E (l^^ — E). 

Consequently, the variational principle and the normalization condition together lead 
to the Schrodinger equation. It is apparent from the above results that the eigenvalues 
of the Schrodinger equation (23.61) give the extremals of the variational integral, A 
more detailed analysis shows that these extremals are minima and that the energy of 
the ground state corresponds to the absolute minimum— the smallest possible energy 
value. In order to calculate excited states, it is necessary to impose orthogonality 
conditions on the wave functions of the lower energy state (as mentioned above), whereas 
orthogonality follows automatically in the Schrodinger theory. 

F. HARTREE-FOCK METHOD OF SELF-CONSISTENT 

FIELDS 

We have considered two extreme cases of the variational method. 
In one case (the Ritz-Hylleraas method) the variation of the wave 
function amounted to a determination of the ‘‘best values^' of 
parameters in a specially chosen wave function. In the other case, 
th e choice of the w ave functions was restricted only, by^theji^mal- 
izatipn^condRiQm^lTns caSe^'leiTta" the Schrodinger equation. "There 
is also an intermediate case in which the wave function is not 
specified, but is assumed to be a product of one-electron functions 
depending on the coordinates only. The specific form of these 
functions is found by using the method of successive approxi- 
mations to solve an equation derived on the basis of the variational 
principle. 

One such method was proposed by Hartree (1928). This method, 
whose interpretation from the point of view of the variational 
principle was given later by Fock, may be described as follows. 
Let us start with the variational principle for two particles in 
general form^ 


£ = J 4 * (ri, ra) H 4 (ri, r^) cPxi cPx^. (23.62) 

As an additional condition, we require that the total wave function 
should be a product of one-electron functions 

(23.63) 


^This integral can be generalized in a similar fashion to the case of three or more 
particles 
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It is also necessary to allow for the normalization condition 

j I** 't'j'J'iti d'^Xid'Xi = 1 , (23.64) 

which can be written for each particle separately 



Substituting the test function (23.63) into (23.62) and varying 
and < 1 > 2 , we obtain 


5 mtr. + mt) (^i + h, + ^) + 

“1“ h -f" 1^2 — 0, 


(23.65) 


where = + is the Hamiltonian describing the 

motion of a single electron (/== 1, 2), and represents the in- 
teraction energy between the electrons. 

From the normalization condition (23.64) we obtain a relation 
between the variations 

I 'I'fS'J'J't’i'l’a 4- tw 4- d%(PXi = 0. 

Multiplying this equation by the Lagrange multiplier X = — E and 
adding it to (23.65), we can select X so that all the variations 
^1^2 1 and so on, will be Independent. Therefore, we obtain the 
Hartree equations 

(Hi + ( \ 1^1 = 0 , 

/ V V e" 

Analogous equations can also be obtained for the complex conjugate 
functions. We multiply the first equation by and integrate it 
over the whole coordinate space of the second particle; in a 
similar fashion, we multiply the second Hartree equation by and 
integrate it over the coordinate space of the first particle. Adding 
the two resulting equations and dividing the total sum by 2 we 
obtain the expression for the energy 

^ ^ ^ J + -^ 2! Z S (23.66a) 

i 
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where in the case of two particles /. /=1, 2. This equation can 
also be used successfully for a larger number of particles. 

If we neglect the interaction energy (that is, if we set the terms 

containing equal to zero) and use the relations 


E = Ei-\-Ei 

and 

Ej= 5 

Hartree’s equation can be resolved into a system of two inde- 
pendent equations 


( H / — £/)'['/ =0 


describing the motion of each particle individually. 

In problems treated by the Hartree method, the field in which 
each electron moves is composed of an external field (for example, 
the nuclear field) and the field produced by all the other electrons. 
This method is, therefore, known as the self-consistent field 
method. 

Fock (1930) generalized the Hartree method by taking into 
account the exchange effects. For this purpose it is necessary 
that the test function in the initial equation (23,62) be consistent 
with the Pauli exclusion principle. The choice of test functions 
is, therefore, further restricted by the requirement of anti- 
symmetry (antisymmetric functions satisfying the exclusion prin- 
ciple will be considered in greater detail in the next chapter). 

Hartree’ s system of equations (for example, for an electron shell 
in an atom) is solved by the method of successive approximations. 
First of all, the wave functions in the zeroth approximation (that 
is, neglecting mutual interaction between the electrons) are deter- 
mined. These wave functions together with the interaction potential 
between the electrons are then used to obtain the first approxi- 
mation equations. The first approximation wave function is sub- 
stituted back into the Hartree- Fock equations to obtain the next 
approximation and so on. This process is continued until the 
solutions obtained in successive approximations are identical 
(within the desired accuracy), that is, until the solutions are self- 
consistent. Hartree’ s system of equations can be solved only by 
means of numerical methods of integration. With the help of 
modern computers it is now possible to use Hartree’s method to 
determine the energies and the wave functions for both light and 
heavy atoms. 

Another approximation which can be used in treating heavy 
atoms is the Thomas-Fermi statistical method. Although this 
method is not as accurate as the self-consistent field method. 
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it is comparatively simple and predicts many properties of multi- 
electron atoms. This method will be used in our subsequent in- 
vestigations, and we shall consider it in Chapters 25 and 26 in con- 
nection with the theory of the periodic system of elements. 


G. INVESTIGATION OF THE EXCHANGE ENERGY 


Let us consider in somewhat greater detail the physical mean- 
ing of the exchange energy (23.24), which represents the average 
value of the Coulomb interaction between two electrons that are 
both partly in state aii and partly in state tii. In accordance with 
Eqs. (23.30) and (23,32), the total energy of the system is re- 
lated to the Coulomb energy K and the exchange energy A by the 
expression 

E = E^ + K-±A, (23.67) 


where the plus sign corresponds to and the minus sign to 
In order to analyze the exchange energy in greater detail, we shall 
examine the behavior of the system in time, taking into account the 
exchange energy. The wave functions of the symmetric and anti- 
symmetric states may be written as 

(^) = ^ ^ ^and = (23.68) 


Introducing the notation 


E^-\-K 


rO), 


A 

n 


Eq. (23.68) may be represented as 




(23.69) 


(23.70) 


Let us consider a state of the system described by a super- 
position of the solutions and 

ra, = + (23.71) 

It is easy to show that the function 'F (t) represents the general 
solution of the Schrodinger equation (23.7) in the first-order 
perturbation theory. 

We shall now assume that at the initial instant of time (/ = 0) 
one of the electrons is in state /Zi, and the other is instate 
Then the function 


¥ ( 0 ) = - ((c c=*) « + (C' - n 


(23.72) 
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is simply equal to the function u. It follows that 
-^(C“+ (?)=.!. and C®-C=*=0. 

that is. 


C'=C=‘=— 

y-i 


(23.73) 


According to Eqs. (23.70), (23.71) and (23.73), at a certain later 
time t , the function *F (i) becomes 

W (0 = e- {« cos 8/ — iv sin 8i} =e-‘"‘ {C„« + C^v}, (23.74) 


where 


C„ — cosSi, C„ = — tsinS<. (23.75) 

It is obvious that the amplitudes and which satisfy the 
normalization condition 


( 23 . 76 ) 

characterize the probability of a system being in states described 
by u and v, respectively. 

Since at / = 0 we have = 0 and C,^=l, the system is initially 
in a state represented by the function u. However, after a time 


(23.77) 


(so that 8/ = y), it can be seen from (23.75) that the coefficients 
Cy and Cj,, will have the values 

C^ = 0and = 

At the time / = T,the state of the system will therefore no longer 
be described by the function u, but instead by the function v. This 
indicates that whereas at time / = 0 one of the electrons of the 
system is in state and the other in state after a time A^=='c, 
the first electron will be in state fh and the second electron in 
state /Zj. The time t (23.77) during which the “exchange*’ of 
electronic states occurs is called the “exchange time.*’ It is con- 
nected with the exchange energy A by the simple relationship 


7c 

26 • 




(23.78) 
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It is readily seen that if there is no exchange energy (>1=0) then 

T = OO. 

To conclude this section, let us evaluate the exchange energy 
when the first electron is in the state /ii=l5, and the second 
electron is in the state ni = 2s. The wave function is given 

by (23.37) and the wave function — '^2 ^ found from (13.26) 

and (13,28)® 




(23.79) 


For the exchange energy, instead of (23.38), we obtain 


= J 'l>,(rs) l-i (o) ^ (^i) d%dW (23.80) 


Carrying out the integration in the same way as was done for the 
evaluation of energy, we obtain, instead of (23.38a), 


CO 




- (3/2) 

e 


I^L 

"0 dri 



■ (3/2) 


ao 


dri. 


Evaluating these integrals (which is a relatively simple process) 
we find that 


A 



(23.81) 


In accordance with (23.78), the exchange time of electrons in the 
Is and 2s states in the helium atom (Z = 2) has the value 


T 


S^Kfiao 


0,8.10-15 


sec. 


The exchange time in the case where one of the electrons is in 
the Is state and the other in the 10s state is of the order of several 
years, and there is practically no exchange. Consequently, it can 
be seen that the exchange energy plays a significant part only 
when the probability densities of different states mutually 

overlap to a considerable extent. When the overlapping of the 
wave functions is insignificant, the exchange energy ^4 is very 
small. This situation somewhat resembles the transfer of energy 
from one coupled pendulum to another. It is well known that an 
oscillating pendulum transfers its energy to another pendulum, 


See also Problem 13.1. 
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initially at rest, and its amplitude of oscillations after a certain 
time interval becomes equal to zero. The exchange time of the 
oscillatory energy depends to a great extent on the relationship 
between the natural frequencies of oscillation of the two pen- 
dulums; the time of exchange attains a maximum when the two 
frequencies coincide (the case of resonance). It must be em- 
phasized that this analogy is purely formal and is possible only 
because of the wave properties manifested in both phenomena. 



("hapter 24 


Elementary Theory of Multieiectron Atoms 
including Spin States 

A. SYMMETRIC AND ANTISYMMETRIC STATES 

As we pointed out at the beginning of Chapter 23, the quantum- 
mechanical theory of assemblies of identical particles contains 
several features that have no classical analog. The most im- 
portant of these features arise from the fact that the state of a 
system is unaffected by interchange of identical particles. This is 
known as the principle of indistinguishability. 

We shall consider here the general properties of a wave func- 
tion that describes a system composed of two identical particles. 
The state of the system is characterized by the position vector r, 
three spatial quantum numbers n , / and m , and the spin quantum 
number s. For the sake of brevity, we let n denote all three of the 
quantum numbers n, I and The wave function of the system 
has the form 


T (/z,5i/-i; ihis.yr>), (24.1) 

where subscripts 1 and 2 refer to the first and second particles, 
respectively. 

Let us introduce an operator P which permutes the position 
coordinates Tj and or the quantum numbers Si and 

PT(/zi5iri; /iiS,r2) = T(/ZiSir2; /liSirO; (24.2) 

It is easy to find the eigenvalues of this operator 

PT(//i6iri, n.iSiri) = W thSir^). (24.3) 


It follows from (24.2) that two successive applications of P lead 
to the initial wave function 


P^T (riiSiru n.iS2r^)^P^ (riiSir^, = T (24.4) 
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On the other hand, it follows from (24.3) that 

(fiiSiru n^Sir^) = 'k'^W {fiiSiru n^s^r^). (24.5) 

The eigenvalues of the permutation operator are therefore 

X = ±l. (24.6) 

This result means that interchanging the particles leaves the wave 
function unchanged (symmetric wave functions, with X= 1 ) 

('hSiA; /^‘25iFa) = ^^ nis^r^), (24.7) 

or causes the wave function to reverse its sign (antisymmetric 
wave functions with X = — n 


/iaSi/-5) = — »r-* (rtjSir,; n.Ar,). (24.8) 

According to a principle of quantum mechanics, a set of 
identical particles can only exist in states with a definite type of 
symmetry, A state is either symmetric (described by a symmetric 
wave function) or antisymmetric (described by an antisymmetric 
wave function). Quantum transitions between symmetric and anti- 
symmetric states are impossible; states with different types of 
symmetry do not mix. This suggests that there are two kinds of 
particles: one kind described by symmetric wave functions, 
and the other by antisymmetric wave functions. 

B, FERMI-DIRAC AND EINSTEIN-BOSE STATISTICS, 

THE PAULI EXCLUSION PRINCIPLE 

Experimental and theoretical investigations of the properties 
of systems of identical particles have shown that there are two 
kinds of particles with fundamentally different statistical prop- 
erties, and that this difference is essentially related to the spin. 

( 1 3 

s = ••• units of the 

Planck constant obey Fermi-Dirac statistics. These particles 
are known as fermions and include electrons, protons, neutrons, 
/i mesons and hyperons (all with spin Va)* On the other hand, par- 
ticles with integral spin (s = 0, 1, ...) obey Einstein-Bose statistics. 
These particles are known as bosons and include w mesons, K 
mesons (both with spin 0), and photons (spin 1). 

We may note here, without going into a detailed analysis of the 
statistical properties of aggregates of particles, that in Einstein- 
Bose statistics an unliminted number of particles can occupy each 
state. On the contrary, in Fermi-Dirac statistics only a single 
particle can occupy each state defined by four quantum numbers. 
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This characteristic feature of fermions was established empirically 
by Pauli (1923) before the discovery of quantum mechanics and 
quantum statistics; it is known as the Pauli exclusion principle. 

In order to establish the relationship between the type of sym- 
metry of a state and the kind of statistics that is appropriate for 
this state, let us consider a system of two particles which have a 
negligible mutual interaction. The wave function for the system 
can be written in the form of the product 


W (/q.sjr, ; n^s^r^) = (n) ( 24 . 9 ) 

where and are the wave functions of the individual parti- 
cles. The solution (24.9) corresponds to a state in which the 
particles with positions ri and are in states characterized by the 
quantum numbers fiiSi and respectively. Because of the 

Identity of the particles, the state of the system is not changed 
when the particles are Interchanged. The following solution is 
therefore just as valid as (24.9): 


T (n^s^ru riiSit^) = (r,) (r,) (24.10) 

(that is, particles with coordinates and are in the states n^iSi 

and thSi , respectively). 

Both wave functions (24.9) and (24.10) describe the same 
physical state of the system. We note that any linear combination 
of these functions will also describe the same physical state. One 
of the possible linear combinations is the sum of the solutions 
(24.9) and (24.10), namely, 

\J/*s I i) (r 2 ) (/'i) 1 ) } (24.11) 

(we have assumed that the functions and are mutually 
orthogonal and normalized to unity, and therefore we have intro- 
duced the normalization factor 'jTf). This solution is symmetric 
with respect to interchange of the spatial and spin coordinates 

Ts ^iiSir^). (24.12) 

Another possible linear combination is the difference between 
(24.9) and (24.10) 

^ (n) (As) — '('n.j, (r,) (r,) }, (24. 13) 

which gives an antisymmetric solution since 


'P=*(rtis,r,; /iiSjr,) =— (niS^r,; niSir,). 


(24.14) 
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From the form of and it follows directly that particles 
obeying Fermi-Dirac statistics can be described only by anti- 
symmetric solutions. The Pauli exclusion principle (Fermi sta- 
tistics) is satisfied only by a solution of the form (24.13) since 
when both particles are in the same state (that is, ni = n.i^n and 
Si == Si = s) , we have 

'Fa (nsri; /isr^) = 0, but 'F^ (nsri; nsr<i)^0. (24.15) 

On the other hand, 'Fa does not satisfy the requirement of Einsteln- 
Bose statistics, in which an arbitrary number of particles can 
occupy the same state. 

It is easy to generalize the antisymmetric and symmetric wave 
functions to the case of a system composed of N noninteracting 
particles. The antisymmetric solution may be represented by the 
determinant 


lira — _ * 

Vn\ 


(^l) ••• 

(/* i) 4*/i252 (^ 2 ) • • • /v) 


(rN) 


(24.16) 


The antisymmetric nature of this solution is obvious since inter- 
changing any two columns changes the sign of the determinant. 
Moreover, it is obvious that the function (24.16) satisfies the Pauli 
principle because if any of the sets of quantum numbers are equal 
(for example, ni — n.^ 81 = 82 ) the determinant vanishes. 

The symmetric wave function can be written in the form of a 
sum of products 


+ 

+ ••• }• (24.17) 


C. ADDITION OF ANGULAR MOMENTA. RUSSELL-SAUNDERS 
COUPLING. CLEBSCH-GORDAN COEFFICIENTS 

Let us consider more fully the antisymmetric wave function 
for a system composed of two particles 

(n) W. (a) — (/•>) (O) }• (24. 18) 

In the many-body problem, we are faced with the question of how 
to add the angular momenta. In the two-particle case, each 
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particle is characterized by orbital angular momentum quantum 
numbers /j and , respectively, for which 


= (/,+ !) (24.19) 

and spins Si and ^ 2 , for which 

Sf = ^%(si+1). (24.20) 

There are two different ways in which these four quantities can be 
added. One of these involves adding the orbital and spin angular 
momenta separately and then finding their total sum 

L^Li-fLg, S = S, + S2, J=L + S (24.21) 

This t 3 q)e of coupling of the particles in an atom is called LS 
coupling or Russell-Saunders coupling. It corresponds to the case 
in which the total orbital angular momentum and the total spin 
angular momentum are conserved separately. This situation is 
encountered most often in light elements. 

The second way of combining the angular momenta involves the 
addition of the orbital and spin angular momenta of each electron 
individually and then finding the total angular momentum of all the 
particles 


= + Si, J,- L, + S^; J = J, -h J 2. (24-22) 

This type of coupling is called// coupling and is encountered mainly 
in heavy elements. It corresponds to the case in which only the 
total angular momentum is conserved, and therefore it is im- 
portant when there is a strong spin-orbit interaction. 

We shall now compare the order of magnitude of the Coulomb 
and spin-orbit interactions in the helium atom. The Coulomb 
energy [see (23.23)] has the form 

/c = es ^ ^ (24. 23) 

(T lies approximately between 0.1 and 1). The existence of LS 
coupling depends on the Coulomb energy since the orbital and spin 
angular momenta are conserved separately in this interaction. The 
spin-orbit interaction for the helium atom 

= -^It L- s -V (24.24) 

(a = Via?) is considerably smaller than the Coulomb interaction, and 
therefore the coupling is of the LS or Russell-Saunders type. It can 
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be seen from (24.24) that the order of magnitude of the spin-orbit 
interaction depends strongly on the nuclear charge z (~Z‘), so 
that for large Z (heavy elements) the value of "• may be quite 
large. In this case the coupling will be of the jj type. 

Let us consider the problem of the addition of the orbital angular momenta of two 
particles and find the function ( 1 , 2 ) which is an eigenfunction of the operator for the 
z component of the total angular momentum L = Li -j- L 3 

L. Vlm (>.2) = (Lu + L,,) (1.2) = (1.2), (24.25) 

and of the operator for the square of the total angular momentum 


(^2) = {Lf 4- L| + (Li.v — ^^iy) (Lajtf + i^iy) + (^Ije + ^Li^) — i^iy) + 

+ 2L,,L3,} (^2) - nnY^j^ (1,2); \ = L(L-\-\) ^24,26) 

As we shall show, the eigenfunction K£;^(l,2) can be written as a linear combination of 
products 


>'zA|('.2)= 2 


(24.27) 


where the wave functions (0 and ( 2 ) refer to the first and second particles, 
respectively, and 


Li. 0)^1lrn,Yfl^ (1); L 2 , Yjl^^ (2)^1im,Y^^^ (2); 

Lf Yfl^ (1) = fi% (/, -f 1) Yfl^ (1); LI (2) = (/, + i) (2) (24.28) 

The numerical coefficients linear superposition (24,27) arc called the vector 

summation coefficients of Clebsch-Gordan coefficients. 

For simplicity, we shall restrict ourselves to the case /, = /o = 1, Substituting (24,27) 
into Eq, (24.25) 

L*>'^a.(1.2)= 2 + (24.29) 

we find that M = nh + Waand 

Without loss of generality we may set M = 0;that is, we make the z axis perpendic- 
ular to the total angular momentum. Then it follows from (24,27) that 

yLoO.2) == b-tyi (1) Lr* (2) + (l) Y{ (2) + (I) TJ (2). (24.30) 

Taking into account the equation for spherical harmonics [see (11,88)] 

(Ux ± lUy) (1) = - » y (/1 + 1 i m.) (/i + m.) ^(^-^'(l), (24.31) 

and using Eq. (24.26), we obtain 

^ fl^ [2YI Kr^ + 2K; KJ] + b, [2Kr* Y\ + 2K?k;i + 

+ bo [4T? K? + 2K1 + 2Yi^Y\]} == X {b_, Y{Y,^ + b, Y[ + K?), (24.32) 

This equation is equivalent to a system of three homogeneous equations for the unknown 
Clebsch-Gordan coefficients 


2^o + (2-X)^i =0 

2bo +(2— X)^,i = 0 

(4 — X) bo 2bi 2b^i = 0. 


(24.33) 
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This system has a nonvanishing solution if its determinant is equal to zero 

2 2 — X 0 

2 0 2 — X =0. (24.34) 

4 — X 2 2 

Calculating this determinant and subjecting the wave function to the normalization 
condiuon 

that is, 

bl-\-b\ + bU = \, 

we obtain three possible solutions: 

1) X = 0, 6. = 

From the conditions X = L (L + 0 we find that ^ — 0; that is, this solution corresponds to 
the case in which the vectors h and are antiparallel 

it t i /a. 

2, 2-2, ».=0, t_, 

This case corresponds to the addition of the angular momentum vectors at an angle 
of 60® 



This solution corresponds to parallel h and U 

A I f A. 

We conclude, therefore, that two vectors h and/*, whose absolute values are Integral 
quantum numbers, can be oriented at angles such that the vector representing their sum 
L is also characterized by Integers. For /j ^ /* these values are 

Z.==/i-f”/2, /i + /* — 1, ...» /i — /a+lf ^1 — la (24,35) 

(a total of 2/a + 1 values). 

This method of adding the angular momenta is based on a vector model of the atom. 
We can obtain our previous results by setting /i=/a=l andM = 0. It is Interesting to 
note that the same values of the Clebsch-Gordan coefficients would also be obtained in 
adding the orbital angular momentum to a spin angular momentum equal to unity (for 
example, for photons). In this case, however, one would have to use the normalized spin 
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function instead of the spherical harmonics It is easy to show that the Qebsch- 

Gordan coefficients will satisfy the following conditions: 


m$ m% LL /n* 


In general the aebsch-Gordan coefficients will depend not only on the quantum 
number m 3 = M — mi, but also on L and M. If the orbital angular momentum of the 
second parade is /a = I, then adding both orbital angular moments , we obtain 

I 

ffia 1 


for which Che Clebsch-Gordan coefficients are given in Table 24.1. 


D. WAVE FUNCTION OF THE HELIUM ATOM 
INCLUDING SPIN 

We shall now consider more fully the wave function of the 
helium atom, for which, as we know, the interaction between the 
spins and the orbital angular momenta of the electrons is of the 
Russell-Saunders type. Since, therefore, the orbital and spin 
angular momenta are added independently of one another, the wave 
function can be written as a product of two parts, one of which 
depends on the spins of the particles and the other on their co* 
ordinates. The wave function must be an antisymmetric function 
with four quantum numbers 

T^=:C(Si, Sa) ^J^rjina (^1» = C (S 2 , Sj) (/*!, Tg) = 

= C (Sj, S2)^rtjna(^2» ^l)- (24.36) 

There are two possible cases: either the function is symmetric 
with respect to the spins and antisymmetric with respect to the 
coordinates, or the reverse case is true. We have therefore the 
following two types of solution: 

¥» =a (s„ s,) (/•„ r,). (24.37) 

r,). (24.38) 

We recall that the spatial part of the wave function has already 
been obtained (see Chapter 23). For rii ^ n^, 

(n, r,) = ~ (« + v), (24.39) 

KnAn, r,) = ^(M-v), 


(24.40) 



Table of Clebsch-Gordan coefficients (for l» = i) 
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(/i -j- M) di -I- Af -j - 1) ■%/~ — M+I) l/" </i — M)(li — jM+I) 

2(/, + 1H2^i + 1) V + V 2;.(2/i + l) 
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where 


U = 'l'«3 

(24.41) 


Let us now investigate the spin part of the wave function* It has 
already been mentioned that in Russell-Saunders coupling the spin 
angular momenta are added independently of the orbital angular 
momenta. The law of conservation of spin, therefore, provides a 
basis for the construction of the spin wave function for the two- 
electron system. We shall take the spin function of each electron 
to be an eigenfunction of the operator for the z component of the 
spin 



(24.42) 


and also of the operator for the square of the spin 

== (<^1 • (24.43) 


Here we write 2x2 Pauli matrices <s (see Chapter 16) without 
primes 


ai = 





Thus the spin function C = of a single particle satisifies two 
equations 


S,C _ OjC — ~ 

Since — 1 , and so on, we obtain from Eq. (24. *V/ r^.,2' /4* JL 

matrix equation (24.44) for X, is equivalent to a system of two 
homogeneous algebraic equations 

c,(V,-X,) = 0, 

Cj(V3 + M = 0. (24.46) 

from which it follows that there are two solutions corresponding to 
the two possible orientations of the spin with respect to the z axis 


(24.44) 

(24.45) 


1) X] — ’/s. Cj — 1, Cj — 0. 


(24.47) 
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Here the spin is parallel to the z axis. The wave function corre- 
sponding to the eigenvalues V 2 is 


2 ) 



Xj — — Cl — 0, 


^2 = 1 . 


(24.48) 

(24.49) 


In this case the spin is antiparallel to the z axis. The correspond- 
ing wave function is 


C 



(24.50) 


In Eqs. (24.48) and (24.50) the value of the z component of the 
spin is indicated in the parentheses following the amplitude C. 
It is easily shown that both spin parts of the wave function satisfy 
the orthonormality condition. If, in accordance with the definition, 
we take the Hermitian conjugate spin function to be a single-row 
matrix 


then from (24.48) and (24.50) we see that 


and 




1 , 




= 0. 


The effect of Pauli matrices on the spin function, according to Eqs. 
(24.50), (24.48) and (24.44), is 

„c(±})=c(:pi)-. „c(± = 

.,c(±i) = ±c(i-i). (24.51) 

We shall look for the spin function C of the two electrons of the 
helium atom in the form of a superposition 

C=o,C, (i) c. (i) + «.c, (I) C, (- J-) + 

(— 7) ( 7) +Oi<^i(— -2') (“ ■}) • 


(24.52) 
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where C, and are the spin functions of the first and 

second eiectrons, respectively, while ai, a,, a,^ and Cj are the 
Clebsch-Gordan coefficients that have to be determined. Let us now 
attempt to select the coefficients of the function (24.52) in such 
a way that the latter will be an eigenfunction of the operator for the 
z component of the total spin 

s,=s;+s:=|(a;+a'3), (24.53) 


and also of the operator for the square of the total spin 




= (S' + Sy a"' + 2 <J'. ff"} == 


(24.54) 


Here the primes and the double primes on the Pauli matrices 
indicate that these matrices act on the spin functions of the 
first and second electrons, respectively. We thus have two 
matrix equations 


|{a' + a')C = /iX,C, 


(24.55) 




(24.56) 


Substituting the total spin function (24.52) and taking into account 
the effect of the Pauli matrices on the spin functions of the in- 
dividual electrons [see (24.51)], we find from (24.55) 

(.1) C, (1) - (- -1) C, (- j) = 

-1- a^C, (— -i) Cj (y) + OiC, (— I) Ci (— -i)]. 


Equating the coefficients at the same spin functions on both sides 
of the equation, we obtain the equations for the parameter Xj and 
the coefficients 


Oi(l — X|) — 0, (z^Xi^O, flgXj^sO, 04 ( 1 -j-Xj) — 0 (24.57) 
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Similarly, using the relation 

(»'■ «'■) C = «.c, ( y C. (i) - a, [C. (1) C, (- 1) - 
-2C.(^i)c.{l)]-a,[c,(^i)c.(l)-2C,(i)c.{-i)] + 

+ «.c,(— i)c,(-i), (24.58) 

which can be obtained with the aid of (24.51), we obtain from (24.56) 

2 Ui = 

= a2 = (X2~- l)a3, (24.59) 

As usual we set X 2 = i;(.S+l), where 5 is the eigenvalue of the 
total spin. We note that the systems of equations (24.57) and 
(24.59) must be satisfied simultaneously. From these equations 
we find that the possible nonvanishing solutions have eigenvalues 

X 2 = 2, with Xi = + 1, 0, — 1 (triplet), 

X^ ~ 0, with Xj = 0 (singlet) . 

Consider now four cases. Case 1. Total spin is equal to unity and 
is directed along the z axis. The spins of both particles are par- 
allel. The corresponding solution, given by (24.52), is symmetric: 

(24.60) 

X2 = 2, 

Xi = 4-1, a^==ai = ai = 0. 

Case 2. Total spin is equal to unity and is antiparallel to the z 
axis. The spins of both particles are parallel. The corresponding 
symmetric solution is 


C* = (- -[) (- [) . II . (24.61) 

X2=r:2, 

Xj = — 1, ai = a2 = cii^=0. 

Case 3. Total spin is directed perpendicular to the z axis and is 
equal to unity. The spins of both particles are parallel. The cor- 
responding symmetric solution is 




^ Ir 

wr 


^ '' C 
2 r'^ 


2 )+^* (i)l' {r}/’ 


(24.62) 


= 


1 

Vi- 


X 2 — 0, 


a, = Oi = 0 
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Case 4* Total spin is equal to zero. The spins of the particles are 
antiparallel. The solution is antisymmetric: 

(!)• ( 24 . 63 ) 


Xj=0, a^ = aj = 0 

We note that the choice of the nonvanishing Clebsch-Gordan co- 
efficients was such that all four solutions are normalized to unity. 

Returning once again to the spatial part of the wave function for 
the helium atom and using the fact that the total solution must be 
antisymmetric we have, in accordance with (24.37) and (24.39), 

= C^ (si, s,)r (n, /-,) (24.64) 

(three states) and 

(si. 5,)f (r„ r,) (24.65) 

(one state). The spin functions are given by Eqs. (24.60)-(24.63), 
and the spatial part of the wave function for th is 

(24.66) 

In the case where both electrons are in the same state (/ii=n 2 )# 
there is only one solution with a symmetrical spatial part: 

' = C** (s„ 5,) f ; f = w == (n) (/-,). (24.67) 


E. PARAHELIUM AND ORTHOHELIUM 

We have obtained the wave functions that describe two states. 
One state (parahelium) is characterized by antiparallel spins of the 
electrons [the wave function (24.65), which is symmetric with 
respect to interchange of the coordinates]. The other state (ortho- 
helium) corresponds to parallel spins [the wave function (24.64), 
which is antisymmetric with respect to interchange of the co- 
ordinates; see Fig. 24,1]. A very interesting property of these 
states is that both types of helium are stable; in other words, 
parahelium and orthohelium are not converted into one another. 
We can convince ourselves that both systems are closed by direct 
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calculation. The matrix element of the dipole moment correspond- 
ing to a dipole transition from orthohelium to parahelium 


— ^ (ri,ri)d''x,d'xt= (24.68) 

= — ^ (n.''i)(n + ri)'P {/‘t,ri)d%d^Xi, 

turns out to be equal to zero, since 


^s-a — ^s-a — 


(24.69) 


[In (24.68) we have changed the 
variables of integration and used 
the symmetry properties of the 
wave functions.] Dipole transitions 
from one state to another are 
therefore forbidden. 

Mutual conversion of these 
states may, however, take place 
under the action of other parti- 
cles. For example, the bombardment of orthohelium by electrons 
may lead to the replacement of an ejected electron by another elec- 
tron with opposite spin, and thus orthohelium can be transformed 
into parahelium, and vice versa. 



r^g 24 1. Electron spin orientation 
in the helium atom 


F. ENERGY SPECTRUM OF THE HELIUM ATOM 

It has already been noted that the total orbital angular momentum 
L, which is a result of the addition of the orbital angular mo- 
menta lx and /.jOf two electrons, assumes integral values (Russell- 
Saunders coupling). In the particular case where = (both 
electrons are in the p state), the total orbital angular momentum 
takes the values L — 2, 1, 0. In terms of the vector model, these 
values correspond to the following situations: 

1. L = 2. The angular momenta are parallel; 

/if t /a h = lx-\-l^ = 2. 

2. L= 1. The angular momenta are at an angle of 60®; 
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3. L = 0 The angular momenta are antiparallel; 

/it 1/4 Z.=:/i~/a = 0 . 

In the general case where the number L takes all possible 

integral values 


L = — U ^i“t“^4 — 2, ... , /i — /a. (24.70) 

In denoting the energy levels for multielectron atoms, we follow 
the same rules as for hydrogen-like atoms, except that the states 
with a definite value ofthe orbital angular momentum L are denoted 
by capital letters 


L = 0 S state 

L = 1 P state 

L = 2 D state 

L = 3 F state, and so on. 

Let us enumerate the energy levels of the helium atom. The lowest 
level corresponds to the total orbital angular momentum L = 0 with 
both electrons in the s state. This singlet term of parahelium is 
denoted by 


{lsls)'S, 

and corresponds to an antiparallel spin orientation. The super- 
script at the upper left-hand side of the letter characterizing a 
particular term indicates the number of states (the multiplicity 
of states). 

The next term corresponds to the case when one of the electrons 
is in the Is state and the other is in the 2s state. In this case, both 
parahelium 


(ls2s)% 


and orthohelium 


(ls2s)35i 

are possible. Orthohelium, unlike parahelium, has a spin angular 
momentum and exhibits anomalous Zeeman splitting in a magnetic 
field. 

We note that the (ls2s)®5, level of orthohelium is metastable, 
because a transition to the lower energy state belonging to para- 
helium is forbidden by the selection rules. The levels of para- 
helium are singlets (spin 0), and those of orthohelium are triplets 
(spin 1). 
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Parahelium -g Orthohelium 

■g| l ifs2p) \ 




20 


18 

16 

/4 

12 

10 

8 

6 

4 


Js2^) 

z(!sJsL% 


The splitting can be easily explained in terms of the vector 
model of the orbital and spin angular momenta. According to the 
vector model, the sum of the two vectors L and S (that is, the 

total angular momentum vector) 
can take the values 

y==L + 5, L + 

... (24.71) 

For the total number of 

these values is 

2S+1, (24.72) 

whereas for L<^S it is 


E,ev, 

24 

22 

20,55 

19.77 




[ls2p) >P, 

snip) ‘"i 

Metastable [ts2t) •‘J, 
state 


2L+1. 


(24,73) 


Ground 

level 




h'lfi 24.2 Energy level diagram for the 
helium atom (the splitting of the 
level IS not in scale) 


It follows that for the 5 term 
(L=:0),the total number of pos- 
sible states for both ortho- 
helium and parahelium is equal 
to one. Inexactly the same way, 
the s term of the hydrogen 
atom (one valence electron) is a 
single, and the number of states 
is equal to two (that is, the levels are doublets) only for p, d terms 
(for which />s). 

The total spin 5 of helium assumes two values: 5 = 0,1. For 
5 = 0 the number of states (25 -f- 1) is equal to unity, and for 5 = 1 
it is equal to three (25+1=3); that is, the state is a triplet. A 
general diagram of the energy levels of the helium atom is given 
in Fig. 24.2. 

For elements of the third group (5 = |- and5 = - 2 ")> 

are quartets or doublets. Thus the number of valence electrons 
(see Chapter 25) completely determines the splitting of the spectral 
lines. 

The theory of multielectron atoms is treated in several books. ^ 

A particle is In the Yukawa potential well 




^See, for example, D. I. Blokhintsev, rrinciplcs of Quantum Mechanics (trans.), Allyr 
and Bacon, 1964. 

^These problems refer to Chapters 23 and 24 



THE MULTI ELECTRON ATOMS INCLUDING SPIN STATES 


395 


Choose a test function in the form = and determine the upper limit of the 

values of >1 at which at least one discrete level will exisL Using the obtained solution, 
consider the special case of a Coulomb potential ^ = 0). Show that discrete 

levels always exist for the Coulomb potential. Determine the lowest energy level and the 
corresponding wave function. 

Solution, The normalized wave function is of the form [see (23,47)]: 

Yk 

The average values of the kinetic and potenaal energies are [see (23,52), (23,53)] 




The appearance of the first discrete level Is possible under the conditions 


Ei~T+ P = 0, 




from which we find 


^max — '• 

Discrete levels exist for 7: < 0. This happens when /c < ^niax* ® Coulomb field 
y. =0, and therefore this condition is always satisfied, 'Hie average energy in the Coulomb 
case is 


' 2m, 


Using the condition that Ei must be minimum, we find 

Pi 


^0 


where Oq- 






is the radius of the first Bohr orbit. The expressions for the wave func- 


tion and the average energy will be exactly the same as those yielded by the Schr'odlnger 
equation for the Is level of the hydrogen atom (see Chapter 13), 

Problem 24,^ Calculate the diamagnetic susceptibility of the ground state of one 
gram-al"5W "W '[^rahelium, using the expression (23,47) obtained for the wave function by 
the variational method. 

Solution, In the ground state of parahelium, the orbital and spin angular momenta are 
equal to zero(/=:0), and therefore parahelium must be diamagnetic (see Problem 20,1). 

In accordance with (20,48), the diamagnetic susceptibility per gram-atom is given by 
the expression 


•/=_ r^), 

where N Is Avogadro*s number and 

/I + ri = ^ (rt + r|) | -|/ (r,. r,) |= d'x, d'x,. 
From (23.47) we obtain an approximate expression for (ri, r.) 


(24,74) 


(24.75) 


^{rur.) = 


^.3 

T.a\ 


a— Z'iri -1- r-l/rt! 
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where for helium 



27 

16* 


Evaluating the integral (24,75) with the above expression for we obtain 



The diamagnetic susceptibility is therefore 

•/theor =— 1 67 • lO ®, 

which is in fairly good agreement with the experimental value 

=_ 1.90 . I0-". 



(lhapler 25 


Optical Spectra of Alkali Metals 


A. GENERAL DISCUSSION OF THE STRUCTURE 
OF COMPLEX ATOMS 

It has already been pointed out that the position of an atom in 
the periodic table of elements is determined by the atomic number 
Z. This number characterizes the nuclear charge in units of e©; 
that is, it is an integer that is equal to the number of protons in 
the nucleus and also to the number of electrons in the neutral 
atom. Thus, for example, sodium has 11 electrons (Z=ll) and 
uranium has 92 (Z = 92). 

The most important questions in the theory of a complex atom 
concern the electron density distribution and the energy spectrum. 
In studying these questions it is necessary to consider the mutual 
interaction between all the electrons, in addition to the attraction 
between the electrons and the nucleus. Owing to the interelectronic 
interaction, the energy values of the electrons will be smaller (in 
absolute value) than those given by the well-known expression 


/JRh 

n- ' 


Just as in the hydrogen atom, each electron in a complex atom 
is characterized by four quantum numbers. For Russell-Saunders 
coupling (the case where the spins and orbital angular momenta 
of the individual electrons in a given shell are added separately), 
these quantum numbers are as follows: 

(1) the principal quantum number n = 1, 2, 3, 4, ... , 

(2) the orbital angular momentum quantum number I =0, 1, 2, 
. . . , (n — l). 

(3) the magnetic quantum number //2 = 0, i 1, . . . , ± /, and 

(4) the spin quantum number m = i 1/2 (characterizing the 
projection of the spin on the z axis). 

In the case of jj coupling, the four quantum numbers are as 
follows: 

(1) the principal quantum number n , 

(2) the orbital angular momentum quantum number / , 

( 3) the total angular momentum quantum number / = 1 / ih '/a | , and 
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( 4 ) /ny = — /, — /-f 1 . . . / — 1, / (characterizing the component 
of the total angular momentum along the z axis). 

As we have already said, Russell-Saunders coupling (or LS 
coupling) is characteristic of light elements, while jj coupling is 
characteristic of heavy elements. Both types of coupling give the 
same number of levels. 

A group of closely spaced energy levels, which is separated 
from other energy levels by an appreciable gap, forms a shell 
made up of several subshells. Just as for the hydrogen atom, the 
classification of the shells is based, as a rule, on the principal 
quantum number n. The relationship between the principal quantum 
number n and the letters which are used to denote the shells in 
x-ray spectroscopy is as follows: 


n 

1 

2 

3 

4 

5 

6 
7 


Shell symbol 

K 

L 

M 

N 

0 

P 

Q 


Within the shells, electrons with values of the orbital quantum 
number / equal to 0, 1, 2, 3, . . . form the s, p, cf, , sub- 
shells. To determine how the shells and subshells are filled, one 
should take into account the Pauli exclusion principle, according 
to which there can be only one electron in each quantum state 
characterized by four quantum numbers. In a state with fixed 
values of n, /, m, there can be at most two electrons, differing in 
spin direction. Since the quantum number which varies from 
-/to +/, can take 2 / + i values, we obtain the following expression 
for the maximum number of electrons in a given subshell: 


N,, = Ni = 2(2l + l). (25.1) 


^The same value for /V/ is found for 77 coupling For given values of the three quantum 
numbers n, I, j, the fourth quantum number may take the values —j, .. , —Vi, V^i, , 7, 
corresponding to 27 +• 1 states Hence we obtain the same value for N; 

+ ») - 2(2/ ( 1) , 

; 


where 
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It follows that the maximum number of electrons In the subshells 
s(/ = 0), p(/ = l), d{l = 2) and /(/ = 3) Is as follows: 

N, = 2. A/p = 6. N, = \0, Nf = H. 

Subshells with a larger value of I are not encountered in unexcited 
atoms, and therefore we shall not discuss them here. 

We can now find the maximum number of electrons which can 
occupy a given shell 


n — 1 


N 


„ = 2 A/, = 2(l+3+... + (2rt-l)) = 2n (25.2) 


/=0 


Thus there can be at most two electrons in the K shell, eight 
electrons in the L shell, eighteen electrons in the M shell, 32 
electrons in the N shell, and so on. It should be mentioned that 
these relations between the quant\im number and the number of 
states in the various shells are true, generally speaking, only for 
hydrogen- like atoms. 

To establish the order in which the shells and subshells are 
filled in complex atoms, it is necessary to consider the mutual 
interaction of the electrons. As we have emphasized several 
times, this problem cannot be solved exactly. The application of 
quantum mechanics to multielectron systems, however, has led to 
the discovery of various important properties of such systems, 
including, in particular, the exchange interaction. Moreover, the 
development of approximation methods has made it possible to ob- 
tain highly satisfactory quantitative results for complex atoms. 
As indicated in Chapter 23, the simplest approximation methods 
are the variational methods developed by Rltz, Hylleraas, and 
others; these give comparatively good results for the light atoms 
(up to potassium). A more complete analysis of the structure of 
an atom can be made by means of the self-consistent field method 
developed by Hartree and Fock. This method has been used to 
determine the energy and the electron distribution in heavy and 
light atoms (see Chapter 23). This method also gives the shell 
structure of complex atoms. ^ 

Unfortunately, the use of the self-consistent field involves some 
very complicated computations, which can be carried out only by 
computers. As a result, the eigenfunctions characterizing the 
electron distribution are obtained in the form of numerical tables 
rather than analytical expressions. 


^For more details see Hartree, D R The Calculation of Atomic Structures, New York: 
John Wiley & Sons, Inc., 1957. 
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B. THE THOMAS- FERMI STATISTICAL METHOD 


Besides the approximation methods, there are statistical 
methods that have been introduced on the basis of principles 
developed by Thomas and Fermi. These methods apply mainly to 
heavy atoms. 

In the statistical treatment, the electrons of an atom are re- 
garded as a degenerate gas with T = 0, just as in the theory of 
metals (see Chapter 6). The Thomas-Fermi statistical method is, 
of course, less accurate than the Hartree-Fock self-consistent 
field method, because many features of the behavior of the in- 
dividual electrons are neglected. In spite of this general short- 
coming, the Thomas-Fermi method provides a fairly simple 
explanation of many important properties which are characteristic 
for the average behavior of electrons in an atom. 

The Thomas-Fermi method does not enable us to find the 
shell structure of an atom, but it does explain several important 
features of the filling of the electron shells. This method can be 
used for calculating the total bonding energy of electrons in an 
atom and the radii of atoms and ions. It can also be used to deter- 
mine the influence of screening electron shells on the scattering 
of fast electrons, bremsstrahlung, and the creation of electron- 
positron pairs due to absorption of x-ray photons in the nuclear 
field. 

Presently statistical methods are being successfully used to 
construct a theory of heavy nuclei and a theory of matter at high 
pressures (for example, inside stars). 


In heavy atoms the positively charged nucleus is surrounded by a cloud of negatively 
charged electrons, which partially screen the electric charge of the nucleus. The po- 
tential of an ionized atom at distances greater than its radius is given in the first 
approximation by the following expression: 


'I’ClT 


(Z-N)eo 

r 


(25.3) 


where Z is the atomic number, and N is the number of electrons. For a neutral atom 
N~Z and, therefore, (]>oo = 0; that is, the electrons completely screen the charge 
of the nucleus. 

In constructing a statistical theory, three forms of energy should be taken Into 
account. 

1. The electrostatic energy of attraction between the electrons and the nucleus. 
This energy is related to the electron density po (the number of electrons per unit 
volume) by the expression 


Fn-e — ^0 ^ Po^^n (25.4) 

where e~ — is the charge of the electron, and ^ is the potential produced by 
the nucleus. 
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2* The electrostatic energy of repulsion between the electrons 
Ve-e=-j J Po‘»e 


(2S.4a) 


where 


<De (r) — J 

3, The kinetic energy of the electrons in the atom. Just as in the theory of solids at 
absolute zero, the average kinetic energy of an Individual electron is related to the 
electron density Po by the following expression, in accordance with Eqs, (6.33) and 
(6.338)^: 

7'av = >'Po (25.4b) 


where 


X 




Hence the kinetic energy of the electrons is 


(25.5) 


r=x 



(25.6) 


The total energy of an electron gas in the field of a nucleus is equal to the sum of the 
potential energy, which consists of two parts [see (25,4) and (25.4a)], and the kinetic 
energy [see (25.6)]. Thus the total energy is 


= x J 


£=7-+K„.e+l/e-c = 


(25.7) 


The density of the electron gas must satisfy the condition 


s 


Po 


(25.8) 


where N is the number of electrons in the atom. 

With the auxiliary condition (25.8), the variational principle can be formulated as 
follows: 


B{£’ + ^o^oA^}=0. (25.9) 

Using this principle, we can find a relationship between the total potential $ = <I>n + 
and the density of electrons po 


Po 




(25.10) 


3 

These formulas were obtained on the assumption that at most two electrons can occupy 
each quantum state characterized by three quantum numbers. Thus the Thomas-Fermi 
statistical theory automatically incorporates the Pauli exclusion principle, which is of 
fundamental importance in the theory of complex atoms 
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where the Lagrangian multiplier (I>fl , which plays the role of a constant potential, must 
be found from the boundary conditions. In the derivation of (25, 10), we have used the 
relations 


8 ^ J p.VaBporf'x, 

8 jpo'I>n(/’A:= ^ 

1 1 J 

‘1 J [? ?»(»“) Po (r’)+P._(yp , (r')] d>xd>x ' ^ ^ sj f,a>x^iN. 


(25. H) 


Substituting Eq, (25.10) for the electron density into Poisson's equation for a spheri- 
cally symmetric electron distribution 


z= J- = 4neoPo (25. 12) 

r dr* 

and recalling that <I>o = const, we obtain the Thomas-Ferml equation which forms the 
basis of the statistical model of the atom 

1 r (<!> - Oo) = (1m,e,Yh (O - $„)’/». (25. 13) 

When a specific problem is investigated, the solution of Eq, (25,13) must satisfy 
certaip definite boundary conditions. For an atom the boundary conditions can be given 
in the form 


and 




Zcq 


for r — 0 


(25.14) 


, (Z—N)eo. (25.15) 

LJi for r = ro. 

f 0 

Here r© is determined from the condition that the electron density should be equal to 
zero at r = r© f that Is, p© (r©) == 0, Hence, in accordance with (25, 10), we find 


^0 


(Z-N)eo 

ro 


(25,16) 


With the help of Poisson's equation (25,12) and (25, 13), the condition (25,8) may be 
written as 


\r—%zr^‘lr = Nei, (25.17) 

0*^ 

It follows from (25.16) that ^»© = 0 and r© = oo for a neutral atom (N^Z), Instead of 
(25,17) we get, therefore. 


oo 



dr = Z^o. 


This condition will hold if the second boundary condition is satisfied 


lim r^ = 0. 

r-*oo 


(25,18) 
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We note that the lliomas- Fermi equation (25, 13) has one exact solution 


81 ^ __ 
^mlel 


(25.19) 


as can be easily shown by substituting (25.19) into (25.13), 

This solution for a neutral atom (^>0 = 0) will satisfy one of the boundary conditions 
for r-^Qo [see (25.18)], The second boundary condition for r-*0 [see (25. 14)], however, 
will not be satisfied. 

Unfortunately, solutions of the Thomas-Ferml equation which satisfy both boundax 7 
conditions cannot be expressed in analytic form,^ 

Substituting (25, 14) into (25. 10), we find the behavior of the density po as r 0, 

po = const r” (25.20) 


The solution (25.19) for a neutral atom gives too high a value for ^ asr— * 00 , 
The more exact Hartree-Fock method shows that the electron density will decrease 
exponentially as r co . 

Since we are interested only in the basic principles of the statistical method, we shall 
construct an approximate statistical theory of the atom with the help of the variational 
method. This will enable us to formulate a solution of the problem in analytic form with 
only small quantitative deviations which are of no Importance to us. 


C. SOLUTION OF THE THOMAS-FERMI PROBLEM 
BY THE RITZ VARIATIONAL METHOD 


Solving a problem by the Ritz variational method, one can propose any number of 
test functions which depend on the different values of the variational parameter X. 

We shall choose the test function on the basis of the following considerations. The 
function should agree roughly with the solution of the Thomas-Fermi equation for 


may note that the numerical integration of this equation presents two advantages 
over the numerical integration of the Hartree-Fock equations First, the Thomas-Fermi 
equation is much simpler than the Hartree*Fock equations Second, this equation and the 
boundary conditions (for example, - 0 for a neutral atom with Z - N) can be transformed 
into a universal form which is independent of Z To do this, we must replace $(r) by a 
new function 

^(r) -- , 

T 


where 



With the function f{x),Eq. (25.13) becomes 



(25 13a) 


and the boundary conditions (25 14) and (25 18) become 

f(x) ^1 for X -» 0 , f{x) = 0 for x-»00. (25 14a) 

These equations are of a universal character, that is, they do not necessarily depend on 
the quantity Z. Therefore, after numerically integrating the Thomas-Fermi equation, we 
can change the variable (which depends on Z) and use the equation to investigate any 
heavy atom. This cannot be done with the Hartree-Fock equation. 
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r 0 (this region is the most important with regard to the solution of the problem as a 
whole), and it should have a comparatively simple form, so that it can be exactly in?- 
tegrated in the calculation of the total energy. One test function which satisfied tihese 
requirements is 


po = 




(25,21) 


This function is already normalized to the total number of electrons 


/» 

1 ^ — I ==//, 


(25,21a) 


and therefore the auxiliary condition (25,8) will be satisfied automatically. 

For r— 0, the test function (25.21) changes in the same way (po r'”®''-) as the 
solution of the Thomas-Ferml equation [see (25,20)], As we shall see later, this ex- 
plains the good quantitative agreement between results obtained by means of the test 
function (25.21) and those obtained by means of the potential satisfying the Thomas- 
Fermi equation. 

The potential due to the electrons in the atom is 

'^e = - ( 1 - f ~ (25.22) 

This can be easily shown by substituting Eqs. (25.21) and (25,22) for po and into the 
equation 


_ 4jteQPQ 


Using also the expression potential satisfies the 

boundary condition (25,15) for r = ro~^oo, when the charge density vanishes together 
with the exponential term . 

We can also find an expression for the kinetic energy T in terms of the variational 
parameter X. From Eqs, (25,6) and (25.21) we have 


T == 47tx 



9 

'■]7r '^'’“400 






(25.23) 


We have the following expressions for the potential energy due to the interaction 
between the nucleus and the electrons [see (25,4)] and for the potential due to the mutual 
interaction of the electrons [see (25,4a)]: 


Fn-e=- 


ZNe;, 


xv. 


Vr 


dr= — 


ZNel\ 

2 ’ 


(25.24) 


OO 



l^el\ 

16 * 


(25.25) 
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Adding Eqs. (25, 23 >-(25.25), we find that the total energy (25.7) of the electron cloud is 




(25.26) 




(25.27) 


The variational parameter X , corresponding to the reciprocal of the effective radius 
of the atom, can found from the condition for the minimum of the total energy /: of 

dE ^ 


the atom, that Is, 


: 0. Hence we find 


I 9 I'HnY/s 


'eff — I— 100 V 2 


(25.28) 


._-K— g . 


For a neutral atom (N — Z), we have 


/?eff • "^1/7’ 


1 ? £? yVi 

9 64 \3tc/ Go 


-0.758.. -'^Z' 
do 


(25.30) 


It Is worth noting that numerical Integration of the Thomas-Ferml equation leads to a 
value for the energy of the atom which is very close to that given by (25.30) 


t T-F = —0,709 Z’''’ = — 20.94 ev. 
«0 


(25.30a) 


The expression (25,30a) (with the sign reversed) represents the total bonding energy 
(or ionization energy) of a neutral atom, that is, the energy required to remove all 
the electrons from the atom. 

The theoretical values obtained for W from (25,30a) are quite reasonable even in the 
case of the hydrogen atom, but they are somewhat higher than the corresponding experi- 
mental values. The relative error decreases with Increasing Z (see Table 25.1), 


Table 25.1 

Theoretical and experimental values of the total ionization 


energy W of atoms (in units of eo^/ao) 
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Concluding this section, let us compare the curves which are obtained for the charge 
distribution in neutral argon (Z=18) on the basis of the Thomas-Fermi statistical 
theory and on the basis of Hartree*s method of the self-consistent field (see Fig* 25,1). 
It can be seen from the graph that the curve of po computed according to the Hartree 
method (Curve B) has characteristic maxima and minima corresponding to the electron 
shells, whereas the curve calculated according to the Thomas-Fermi statistical theory 
(Curve A) describes only the average behavior of the electron density, and therefore has 
no relative maxima. For large values of r there is also a marked lack of agreement 
between the two curves: the Hartree-Fock method gives functions which decrease 
exponentially with increasing r , whereas in the statistical theory the decrease is pro- 
portional to r"* [see (25.19)].s 



Fig 25.1. Comparison of the electron density distri- 
butions in argon (Z = 18) obtained on the basis of the 
Thomas-Fermi method (Curve A) and the Hartree-Fock 
method (Curve B) A pseudologarithmic scale is used 
for the ordinate axis (we have plotted the quantity 
£'n(l f Dqq), where D - Pq) Therefore, the graph 
will be linear for small Oqq, and logarithmic for large 
Dqq The choice of this scale enables us to follow 
the variation of Pq at large values of r, as well as for 
r < Oq 


D. ENERGY LEVELS OF ALKALI METAL ATOMS 

In studying spectral lines in complex atoms, it is necessary to 
distinguish between outer and inner shells. 

In the hydrogen atom there is only an outer shell (the K shell), 
which contains one electron. In helium (Z=:2), the /C shell is 
completely filled (a noble or inert gas). In lithium (an alkali metal 
in Group I of the periodic table, Z = 3), the inner shell (K shell) is 
completely filled, but the outer L shell has only one electron. 
The filling of the L shell is completed in Ne(Z — 10). In sodium 
(an alkali metal, (Z = 11) , the inner K and L shells are completely 


A detailed exposition of the statistical theory of an atom is given in P. Gombas, Die 
statisti^che Theorie des Atoms und ihre Anwendimgen, Vienna* Springer- Verlag, 1949. 
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filled, but there is one electron in the outer M shell. The filling of 
the shells in these atoms is illustrated in Fig. 25.2. 

It should be noted that the bonding energy of an electron in an 
inner shell is much greater that that of an electron in an outer 
shell. An indication of this is provided by the ionization energy, 
which is more than 20 ev for inert gases, whereas for alkali metals 
it is only slightly more than 5 ev. The removal of the first valence 
electron in lithium requires the expenditure of 5.39 ev, but re- 
moval of the second and third electrons from the inner shells re- 
quires the expenditure of 76 and 122 ev, respectively. 



Fig. 25.2. Diagram of the filling of electron 
shells in different atoms. On the right, atoms 
with completely filled shells (inert gases) The 
dark spots represent electrons and the light 
spots (with a plus sign in the middle) represent 
nuclei 


The atoms in Group I of the periodic table (Li, Na, K, Rb, Cs, 
etc.) are known as alkali metals; each has an outer shell containing 
one electron, Just like the hydrogen atom. Their optical and 
chemical properties, therefore, should be essentially the same as 
those of the hydrogen atom (for example, all of these elements are 
monovalent and they all exhibit doublet splitting of the spectral 
terms). 

The optical spectrum is caused by the transition of a valence 
electron (that is, an electron in the outer shell) from an excited 
state to a lower state. The excitation of electrons in the inner 
shells generally requires considerably more energy than does that 
of electrons in the outer shells ; therefore, downward transitions from 
excited states to the ground states of the inner shell are accompanied 
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by the emission of x-rays (see Chapter 26). The atomic nucleus 
and the electrons in the inner shells together form what is known 
as the atomic core. Thus the charge of the atomic core is equal 
to Z^ — Z — N, where N is the number of electrons in the inner 
shells. For alkali metals (Li, Na, etc.) N = Z-~\, and thus the 
charge of the atomic core is equal to unity (2,,= 1). Therefore, 
the main part of the potential energy which retains the outer 
electron in the metal is the same as for hydrogen, namely, 

= = (25.31) 

The analysis of the spectra of alkali metals can thus be based on 
the corresponding expression for the energy which was obtained 
in Chapter 13 


= (25.32) 

Similarly, we can use the hydrogen wave functions as the zero- 
order approximation of the wave functions 

W (25.33) 

In alkali metals, however, we cannot confine ourselves to the 
Coulomb energy in treating the interaction between the valence 
electron and the atomic core, and we must also take into account 
the polarization forces and the “smearing^* of the atomic core 
over a finite volume. This yields various corrections and removes 
the / degeneracy which occurs in hydrogen. 

In Bohr^s semiclassical theory, the orbits of valence electrons 
were rigorously divided into orbits which penetrate the atomic 
core, and those which do not. In the case of ‘‘nonpenetrating’* 
orbits (orbits with nearly circular trajectories) we need to con- 
sider only the polarization forces, since the potential outside the 
atomic core (that is, outside the inner shells) is completely 
independent of the radial distribution of a spherically symmetric 
charge. The radial distribution is very important only for “pen- 
etrating** orbits (elongated ellipses) (see Fig. 25.3). 

In quantum theory the concept of a trajectory is no longer 
meaningful. In order to classify orbits as ‘‘nonpenetrating” or 
“penetrating,” we must introduce a new convention; namely, if 
the wave function describing the behavior of the valence electron 
can be set equal to zero inside the atomic core, the orbit is 
“nonpenetrating,’" and if on the contrary the wave Unction can- 
not be set equal to zero inside the atomic core, the orbit is 
“penetrating.” 
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It should be noted in this connection that the s orbit of a complex 
atom is always penetrating, since its wave function differs from 
zero inside the atomic core and, indeed, even inside the nucleus 


16 ( 0 ) 1 ^ 




( 25 . 34 ) 


We shall now calculate thepolarizationforces that arise between 
an outer electron and the atomic core. The outer electron will 
obviously repel electrons in the inner shells and attract the nucleus. 
As a consequence, the atomic core is polarized, and a polarization 
force arises between it and the outer electron 


Fp = — (Z—\)ei 


1 1 ' 
(r + .x)=. 


2ei (Z-])x 


( 25 . 35 ) 


The quantity e(^{Z—\)x=^p represents the polarization of the 
atomic core. 

If we regard the atomic core as 
an elastic dipole, we can set 


= ( 25 . 36 ) 

where p is the polarizability of the 
atom, and 

8 = ^" ( 25 . 37 ) 

F’lg. 25 3 . “Nonpenetrating* and “pene- 
trating" orbits m alkali metal atoms 

is the electric field produced by the 
outer electron at the center of the 

atomic core. Thus we can obtain the following expression for the 
potential energy associated with the polarization^: 







cw 



2r» • 


( 25 . 38 ) 


Since the polarization can be regarded as a perturbation in this 
problem, we have the following expression for the polarization 
energy: 





( 25 . 39 ) 


The coefficient of polarization P is usually determined from semiempirical formulas 
Its numerical values for alkali metal ions (cores) are as follows (in units of lO”^"* cm‘^): 


0 03(Li+), 0 19(Na+), 0 89(K+), 1.50(Rb+), 2 60(Cs+) 
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Since, according to (13-29a), 




/(/+!) 

3n> 


relation (25.39) may be reduced to 


A p 

2aon»’ 


(25.40) 


where 


8 = 8 , — ^. 
* n*’ 


8 ,: 


4aJ (/_!)/(/+ 1) (, + 1) + 1) ‘ (25.41) 


8 _ '(/+i)8 

* 4 — — 3 „»— «>»• 


We can, therefore, find the total energy, which in this case depends 
upon both I and n (for the time being, we are neglecting spin 
corrections) 


£«(==■ 


Rn 


f-A£, 


P- 


Substituting expression (25.40) for ACp and using the relation 

Rn __ 

“"‘2aoV’ 


we obtain 


since 



^ 25 
‘2ao 


2<2o (4 — 5)2 


(25.42) 


(/I — 5)2 «2 y "T ,j8- 

Introducing the “effective principal quantum number^ ==n — S, 
we get 
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It should be noted that Eq. (25.41) cannot be used for s states, 
since the coefficient Sj becomes infinite when 1 = 0. This happens 
because polarization forces can be introduced meaningfully only 
in the case where the outer electron is at a sufficiently great 
distance from the atomic core. For s terms the wave function does 
not vanish even when /• = 0 [see (25.34)]. 

The influence of the inner electrons on the s orbits (penetrating) 
is due mainly to the ‘‘smearing’* of the electronic cloud of the atomic 
core. In general, the additional energy due to the “smearing” of 
the electrons over the volume of the atomic core is given by 

Aevoi= 5 1 1* (r) r V,oid% (25.44) 

where l^voi is the difference between the potential energy produced 
by the electrons of the atomic core, taking into account their 
distribution over a finite volume, and the potential energy pro- 
duced by an equivalent charge concentrated at the center. 

To estimate the order of magnitude of the correction S for s 
terms, let us assume that the Z— 1 electrons of the inner shells 
fill uniformly a volume of radius R. We then have 


Ivol — 


r V 2/? + 2/^«J* 


(25.45) 


Replacing the wave functionfor the s states by its value at the origin 
[see (25.34)], we find the following approximate expression for the 
additional energy of the s terms’: 


where 




2 ZelR^^ 
5 


2fly ’ 


(25.46) 


5 al ‘ 


This expression no longer diverges. 


(25.47) 


’As a rule, the correction S for penetrating orbits is taken into account in the following 
manner It is assumed that the orbit consists of two parts: an outer part and an inner one 
In the outer part of the orbit the electron is acted upon by a point charge t?o^a alkali 
metals Zg ^ 1), in the inner orbit, the electron experiences the effect of point charge CqZ, 
which should be greater than because the screening effect of the inner electrons is 

reduced. It is difficult to determine the charge Z|, theoretically, and it is usually regarded 


as an empirical parameter. 

We thus obtain the following equation for ^ 




ao 


+ 1 ) 




(25 43 ) 


where a is the maximum distance from the nucleus to the electron in the inner ellipsoid 
(that IS, the radius of the atomic core) It is easily shown that the quantity 8^ decreases 
rapidly with increasing /. A particularly good value of 6^ is obtained for s terms. 
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We may note here that according to the Thomas- Fermi model, 
the radius of an atom [see (25.28)] is 

(25.48) 

where the coefficient t > which characterizes the distribution of the 
charge inside the atom, is of the order of unity. 

Consequently, for the total energy of an electron in the case of 
“penetrating” s orbits, we again obtain an equation of the same 
form as (25.42) 

^ n ,/=0 = ’ (25.49) 

where — 8, but now 8 is given by (25.47). 

In order to analyze the difference between the corrections 
for “penetrating” and “nonpenetrating” orbits, we shall take as 
an example the Li atom. 

In this atom the p orbit (/ — l)is nonpenetrating. Equation (25.41) 
gives the value 8^^0.04 for the lowest state (n = 2). According 
to (25.47), the corresponding expression for 8^ for penetrating s 
orbits must be one order of magnitude greater. 

It should be noted that for increasing n and /= const, the 
eccentricity approaches unity; that is, the elliptical “orbits” 
become more and more elongated: 


(25.50) 

In heavy nuclei, therefore, there will also be penetrating orbits 
with / = 0 as well as those with larger values of /. This is re- 
flected in the corrections 5 for the alkali metal spectra, whose 
values are given in Table 25.2. 


Table 25.2 


Corrections 5 for the spectra of alkali metals ^ 


z 

Element 

a. 


Sc 

s, 

1 

il 

0.000 

0.000 

0.000 

0.000 

3 

Li 

0 412* 

0.041 

0.002 

0.000 

11 

Na 

1.373* 

0.883'' 

0.010 

0.00 1 

19 

K 

2.230* 

1.776* 

0.I4G* 

0.007 

37 

Rb 

3.195* 

2.711* 

1 233* 

0 012 

55 

Cs 

4.131* 

3.649* 

2.448* 

0.022 


The values of S for penetrating orbits are indicated by an asterisk 
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It follows from Eq. (25.49) and also from the Table 25.2 that, 
for a given n , the largest downward shift (that is, in the direction 
of decreasing energy) associated with the “smearing” of the inner 
electrons over a finite volume occurs in states with the lowest /. 
In other words, the largest shift occurs for s terms. 

The hydrogen atom alone has no penetrating orbits. For the Li 
atom (Z = 3; that is, the next element after hydrogen in the first 
group of the periodic table), the only penetrating orbit is the outer 
s orbit. In the next alkali metal, namely, the Na atom (Z=ll), the 
s and p orbits are penetrating. 


E. FUNDAMENTAL SERIES 

The energy levels of the hydrogen atom without relativistic 
corrections are given by the well-known relation 



From this we can find the values of the spectral terms 

= = 

(2s) = (2/)) = ^ = f , (25.52) 

(3s) = (3p) = (3rf) = | = 4. 

It follows that the states of hydrogen are degenerate with respect 
to both / and m . A schematic diagram of the energy levels in the 
hydrogen atom is given in Fig. 25.4. 

In the Li atom, the energy levels of the K shell (/i== I) are filled 
(see PTg. 25.2), and therefore it is the L shell which is the outer one. 
Table 25.2 shows that the K shell exerts the strongest influence on 
s terms, and for the corresponding terms we have 

__ 0,412)J — _ I 0 588)>' 

This shift is so large that it was difficult to determine experi- 
mentally whether it belonged to n states or n --\ states. In order 


^In Chapter 21, when the finite size of the nucleus was taken into account, it was also 
found that the largest shift occurs for s terms. In that case, however, it was the positive 
charge that was spread over a finite volume, and therefore the terms were shifted upwards 
and not downwards 
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to make the term notation resemble that of hydrogen, spectros- 
copists have originally attributed the shift to the n — 1 state 

= = = (25.52a) 

where n*=n—\ and s= 1—8, =0.588 . We shall use an asterisk 
for the original notation (n*s) in order to distinguish it from the 
correct one (ns) . 



Fig. 25.4. Energy -level diagram of monovalent 
atoms The potential is usually measured (in 
pu) from the lowest level upwards Here, 
however, we wish to compare the energy 
levels of different atoms and, therefore, we 
have taken the potential at infinity as the 
zero level 


The shift of the other terms of lithium (/=1, 2) relative to the 
corresponding terms of hydrogen is negligible 

(np) = (nV)— 


(nd) = {n*d)- 


R 


(n + dj^ ' 


'®If the principal quantum number n in lithium assumes the values n = 2, 3, 4 (the term 
n = 1 IS occupied by two electrons and forms an inner shell), than the quantum number n* 
takes the values n* - I, 2, 3 ••• • 
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where 


p = — = — 0.041. 

d=~^t=~om2. 

The shell to which a shift belonged could be, therefore, uniquely 
determined and in the old notation the p, and other terms were 
placed in shells which were later shown to be theoretically correct 
(that is, n* ==n i see Fig* 25«4). 

In the next alkali metal, Na (Z— 11), the inner (filled) shells are 
the K and L shells (see Fig. 25.2). As can be seen from Table 25.2, 
the inner shells of Na have a pronounced influence on both the s 
terms and the p terms. The original notation which was used for 
the s terms was 

R R 

{ns) = (n*s) = 1 373 )# = o76^* » (25.53) 

where — 2, and s=-— + 0.627. Thus, spectroscopists 

had originally assumed values of the principal quantum number for 
the s terms of sodium which were low by two units,^^ 

The original notation that was used for the p terms was 

= _|„p)s > (25.54) 

where n*=n— 1, p=l — 8^ = 0.117. Thus, the principal quantum 
number for the p terms was reduced by one unit. The corrections 
for the states c/, /, . . . were negligible, and these terms had been 
assigned to the shells that were later obtained from the theory. 
The energy-level diagram of Na is given in Fig. 25.4. 

The spectral series of alkali metals are as follows. 

1. The principal series. The variable term is the p (principal) 
term. The spectral frequencies in this series are given by 


(o = (l*s) — (n*p), 


which means 


for H:(ls)-- (np), 

for Li:(25) — (/zp), (25.55) 

for Na (35) — (np). 


present, spectroscopists have also adopted a notation which follows from theoreti- 
cal calculations This is the notation used in Fig. 25 4 
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2. The sharp series. The variable term is the s (sharp) term. 
The spectral frequencies are given by 

a)=:(2*p)~(n*s), 


which means 



for H:(2p) — («s), 
for Li : (2p) — (ns), 
for Na:(3/7) — (ns). 

(25.56) 

3. The diffuse series. The variable term is the J (diffuse) term. 
The spectral frequencies are given by 



(25.57) 

4. 

The fundamental series. 



u>=.(3*d) — inVh 

(25.58) 


The variable term is the f (fundamental) term. 

These series take into account the selection rule 


1 . 

The names of the series partly reflect the character of their 
multiplet structure. 


F. MULTIPLET STRUCTURE OF THE SPECTRAL LINES 


Just as in the hydrogen atom, the multiplet structure of the 
spectral lines of alkali metals is due to the spin and relativistic 
effects. To find the splitting of the terms, let us use a relation 
including both the relativistic and spin-orbit corrections for a 
hydrogen-like atom [see (20.18)] 


I n 

^ U+Vs 4j’ 


(25.59) 


where a = el/nc= 1/137 is the fine-structure constant. 

To account for theeffectof electrons from the inner shells of al- 
kali metals, we simply replace Z by a certain effective value <z. 




(tTv. 



(25.60) 
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For “nonpenetrating” orbits, we may set Zgff = 1, because all 
the Z~1 electrons will screen the positive charge of the nucleus. 
For “penetrating” orbits the best value of Zeff is chosen from a 
comparison with experiment. 



AcOg > •= AtUj = A cv^ 

Principal series, con- Sharp series, equl- 
verglng doublets distant doublets 


Fig 25.5. Spectra of alkali metals 

Since the total angular momentum quantum number / assumes 
the values 


and 



for /=o 


j = for/7^0, 


we may conclude that all spectral terms of alkali metals should be 
doublets, except for the s term, where there is no splitting. 

In order to find the magnitude of the splitting, let us calculate 
the value of the spectral terms for two cases? first, when the spin 
and the orbital angular momentum are parallel 


1/2 _ ( n 3 \ 

n ~ n* \/ + i 4r 


(25.61) 


and, second, when they are antiparallel 

- i/j f n 3 \ 

n ~ n* \l 4j 


(25.62) 
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The splitting of the terms is equal to the difference between (25.62) 

and (25.61) 12 




««/(/+!)• 


(25.63) 


We shall use this equation to explain the doublet splitting of the 
principal series, that is, transitions originating from the p levels. 
Setting / = 1 we find 


Ao)^ = 


2n^ 


(25.64) 


It is evident now that the decrease in splitting will be inversely pro- 
portional to the cube of the principal quantum number n (see Fig. 
25.5); thatis, the spectral of the mam series are narrowing doublets. 

In the sharp series, the initial 
level is a doublet and the final level 
(with variable s) is a singlet (see 
Fig. 25.5). The distance between the 
doublets in the sharp series is con- 
stant (equidistant doublets) 



Au)2 = Au) 3 = . , , = Ao)^ = 

16 

In particular, for the spectrum of 
Li we have Zgff =1, and«* = 2(the 
p orbit is “nonpenetrating’’), so that 
this constant splitting is equal to 


Fig. 25.6. Diagram of the spectral 
terms of the sodium atom including 
the fine structure 


Aa)^ = 


' 16 - 


A diagram of the multiplet structure of the principal and sharp 
series of sodium (Z^ 11) is given in Fig. 25.6. 

The diffuse series does not obey such an explicit law for the 
splitting of spectral lines. Each line will be split into three rather 
than four levels (see Fig. 25.7), since in this case in addition to 
the selection rules A/ = zt:l , it is also necessary to take into 


12 

Several other formulas have also been proposed for the doublet splitting For example, 
on the basis of the quasi-classical picture of penetrating orbits, Lande proposed a formula 
replacing in (25 63) n by ~ n — h and Z^ff by Z\ zj, where Z^ is the total charge of 
the ion and Zj is the effective charge of the nucleus in the inner region [see Eq. (25.43)] 
into which the orbit penetrates. For further details we refer the reader to E. Condon and 
G. Shortley, Theory of Atomic Spectra, New York: Cambridge University Press, 1958. 
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account the selection rule for the total angular momentum quantum 
number 


A/=rO. ±1. 

The spectral lines of the fundamental series are also split into 
three components. 

I I n 

— I I' ^ 3/2 

I 

— — * I ^Ps/i 

I 

— ' 2 p ,/2 

Fig. 25.7. Splitting of the diffuse 
series 


The multiplet splitting of the spectral lines of monovalent atoms 
may be explained only by taking into account the spin properties of 
electrons. We have already and repeatedly stated that only the 
half- integral quantum numbers (which characterize the spin) can 
lead to doublet splitting of the terms (as in the Stern-Gerlach 
experiments). 



Chapter 26 


Mendeleyev's Periodic System of Elements 

A. X-RAY SPECTRA OF ATOMS 

X-ray spectra provide important information on the structure 
of the inner shells of atoms and are therefore useful in studying 
the sequence in which the shells are filled by electrons. We recall 
that x-rays are emitted when a beam of fast electrons strikes the 
plate of a cathode-ray tube (see Fig. 26.1). An analysis of the 
emerging x-rays shows that they consist of two different types of 
spectra— a continuous and a line spectrum. The continuous x-ray 
spectrum arises as a result of the deceleration of electrons when 
they strike the target. The continuous spectrum is therefore a 
type of bremsstrahlung. If the deceleration of electrons is equal 
to 6y(tc»<{)), then according to classical electrodynamics, the energy 
radiated by the electrons per unit time is given by the relation 

^ ^ ejw- 

dt 3 * 

A characteristic feature of the continuous spectrum is that it 
is cut off at short wavelengths. The wavelength A mm at which this 
occurs decreases as the potential difference between the cathode 
and the anode increases; the cutoff wavelength 1 mm can be deter- 
mined from the law of conservation of energy 

where is the electron energy before colliding with the 

target, <I> is the potential difference between the anode and the 
cathode, is the energy of the emitted bremsstrahlung photon, 

and is the kinetic energy of the electron after colliding with 

the target. 

Introducing the wavelength X = instead of the frequency co, 

U) 

we find that 


F— Y mov^ 



MENDELEYEVAS PERIODIC SYSTEM 


421 


From the last equation it is evident that X can vary from / = oo (as 
in the case when an electron does not lose any energy during the 

collision with the target to a certain minimum value 


X 


min — 


V* 


(26.1) 


as in the case when an electron loses all its energy in the collision 



Fi{S 5. 26.1. Schematic diagram of an x-ray tube. 
C — cathode, T — target 


The dependence of the radiation intensity on X is plotted in 
Fig. 26.2 for two values of the energy of the primary electrons (25 
and 50 kev). It is clearly seen that an increase in ^ corresponds 
to a decrease of the cutoff wave- 
length X jnjn > accordance with Eq. 

(26.1). This equation was successfully 
used in the determination of a more 
accurate value of Planck^s constant 
h than was given by the Wien and the 
Stefan-Boltzmann laws (see Chapter 
1 ). 

When the energy of the electrons 
incident on the target exceeds a cer- 
tain critical value, determined by the 
target material, a line spectrum, 
which is known as the characteristic 
spectrum, is superposed on the con- 
tinuous spectrum. The line spectrum 
characterizes material of which the 
target is made (or rather, the struc- 
ture of the inner shells of the target material) in the same way as 
the optical spectrum characterizes the structure of the outer shells 
of the atom. For example, in the case of a rhodium target, a line 



Wavelength, X 


Fig. 26.2. Short wavelength lim- 
its of the continuous x-ray 
spectrum. 
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spectrum begins to be superposed on the continuous spectrum at 
an energy of 31.8 kev. 

The properties of the line spectrum are identical for all chem- 
ical compounds of a given element. The characteristic x-ray 
spectrum and the optical spectrum are different in this respect, 
since the latter depends on whether the substance occurs in atomic 
or molecular state (for example, the optical spectra of atomic 
oxygen, the O2 molecule, and the H3O molecule are completely 
different). This can be readily understood since only electrons of 
the outer orbits participate in chemical bonding. 

The spectral lines of the characteristic x-ray spectrum form 
regular sequences or series, just like the optical lines of atoms. 
These series are designated by the capital letters K, L, M, M, • , , ; 
the K series has the shortest wavelength, followed by the L series, 
and so on. 


B. CHARACTERISTIC SPECTRA OF ATOMS AND THE 
STRUCTURE OF THEIR INNER SHELLS 

The mechanism which is responsible for the characteristic 
x-ray spectrum of an element was first explained by Kossel (1914). 
Let us suppose that an electron which is incident on a target re- 
moves an electron from, for example, the K shell of the target 
atom, and thus leaves a vacant site in the /( shell (see Fig. 26.3). 

An electron may jump from the L, 
M, A", . . . shells to this vacant site, 
giving rise to x-ray lines (denoted 
by /<«> The character- 

istic x-ray spectrum is thus formed 
as a result of transitions of electrons 
from one inner shell to another. Since 
the bonding energy of electrons mov- 
ing in the inner orbits is much 
greater than that of the outer elec- 
trons. Electrons of much greater 
energy (several tens of kev) are re- 
quired for production of the charac- 
teristic x-ray spectrum than for the 
excitation of optical spectra (where 
several tens of ev are sufficient). 

Two methods that can be used in 
constructing a theory of the com- 
plex atom, while taking into account the interaction of atomic 
electrons. In the first method the main potential is taken to be 
the potential of the nucleus when it is completely screened by 
the inner electrons. We used this method in constructing the theory 
of the optical spectra of alkali metals. The potential was determined 



Fip;. 26.3. Schematic diagram of 
the origin of a characteristic 
x-ray spectrum according to Kos- 
sel: •—electrons. The dashed line 
represents the ejection of an 
electron from the K shell 
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by the nuclear charge (Ze^ and the charge of the electrons in the 
inner orbitals [— (Z — 1)^0 ]. The total potential was equal to 

— (Z — 1)) = ^. (26.2) 


As the perturbation potential we selected an additional potential 
which took into account the polarization and the space distribution 
of the electron cloud. This method is particularly suitable for outer 
electrons, as, for example, in atoms of alkali metals. 

In studying the motion of electrons in the inner shells, it is con- 
venient to use the potential of the nucleus 


0 = :^ (26,3) 

as the main term in the expression for the potential, and to regard 
the additional potential produced by the electron shell as a correc- 
tion. In this case, the presence of electron shells will result in a 
screening (an effective decrease) of the nuclear charge Zcq by the 
amount and the total potential will be 

= (26.4) 


For example, in the investigation of helium-like atoms, it was 
shown that the interaction of the electrons in the /( shell reduces 
the effective nuclear charge [see (25.35)], which then becomes 

Z' Z — ^ . Thus the quantity is equal to */i6 case. 

The screening constant 5^ must be a function of both n and I . 
It becomes larger as n increases since there is a greater number 
of electrons screening the nucleus. It also becomes larger (but 
more slowly) as / increases, since the orbits become less and less 
penetrating and the effective charge decreases somewhat on the 
average. In the first approximation it may be assumed that the 
screening constant is independent of /. 

The potential (26.4) gives the same formula for the spectral 
terms as was obtained for a hydrogen-like atom, except that the 
quantity Z is replaced by Z — 


Rfi 


(26.5) 


From (26.5) we can obtain an expression for the frequency of the 
Kr, line 






=/?[ 




1 * 


2 ^ 


]■ 


(26.6) 
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It follows that the frequency of the x-ray spectral lines increases 
monotonically as a function of the atomic number Z. This was 
first deduced by Mosley (1914), from an analysis of empirical data; 
he wrote this relationship in a somewhat different form 

= (26.7) 

This formula can be obtained from (26,6) if it is assumed that the 
screening constant for K and L shells is the same, i.e., = = 

We know, however, that this is not quite correct, and, therefore, in 
studying x-ray spectra, just as for optical spectra, we should 
express the frequencies as differences of spectral terms. In 
accordance with (26.5), the spectral terms may be represented in 
the form 




Z-Sn 
Rtl n 


(26.8) 


This relation is called Moseley’s law; it is usually analyzed 
graphically. Ascribing different values to the principal quantum 

number n, we get (see Fig. 26.4): 



for K terms (n — l) 


V R 1 * 

(26.9a) 

for L terms (n = 2) 


*1 /" la ^ — "^2 . 

(26.9b) 

for M terms (/z = 3) 


r 3 • 

(26.9c) 


Fig. 26.4. Most-'ley dia^^ram 


Investigations of the experimental 
curves have given the 


following average values for the 
screening constants: 5, — 3.5, ^ 3 — 10.5J It has also been 

established the x-ray spectra change monotonically with increas- 
ing Z and that no periodic regularities are observed. This repre- 
sents a further difference between x-ray spectra and optical spectra, 
in which periodicity exists (see Section C). 


^See A, Sommt'rfold, Atombau iind Spociralhnicn i, Vieweg., Braunschweig, 1951, 
Chapters 4 and 5. 
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Thus, periodic properties appear only in valence electrons, and 
not in inner electrons. The study of x-ray spectra made it possible 
to prove definitely that the atomic number Z, introduced by 
Mendeleyev, is determined by the charge of the nucleus, and not by 
its mass. It turned out that Mendeleyev had correctly arranged the 
elements Co—Ni, Ar— K, Te— I in a sequence which was not the 
same as the order in which their atomic weight increased. 

There were also doubts concerning the correct sequence of the 
rare earths (elements from Z = 58to71), whose chemical properties 
are very similar. Moseley’s law made it possible to verify 
their arrangement in the periodic system. In addition, the study 
of x-ray spectra made it possible to determine the filling of 
the inner shells of the ferromagnetic metals and lanthanides , where 
the Moseley curve is discontinuous. 


C. MULTIPLET STRUCTURE OF X-RAY SPECTRA 

X-ray spectral terms are primarily determined by the quantum 
state characterized by quantum numbers tu I and /, from which the 
electron has been removed, leaving a “hole.*’ 

As a rule, LS coupling occurs for outer electrons, whereas jj 
coupling occurs for inner electrons (in sufficiently heavy atoms). 
In the heaviest atoms jj coupling begins to play an important role 
for outer electrons as well. 

Since the inner electrons are relatively close to the nucleus, 
they are mainly under the influence of the field of the nucleus , and 
therefore, their energy states are close to those of a hydrogen-like 
atom. We may, therefore, take the following formula as a starting 
point: 



(26.10) 


where the fine structure constant is 


„ ^0 ^ * 

In order to account for the screening of neighboring electrons, 
we must make the substitution Z->Z — S„/in (26.10). We assume 
in this approximation that the screening constant depends not only 
on Ai, but also on /. Then for the x-ray spectra we obtain 

b'j _ 1 _ /_^ 1 \ 

/? Rh^ /i* ' ly-l-i 1’ 
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Taking the approximate square root we find a generalization of 
Moseley^ s law to the case that includes relativistic and spin 
effects 



This formula shows that in Moseley^ s curves there appears a 
term ^ Z* in addition to the term which is proportional to Z . The 
Influence of this additional term becomes marked only at large 
values of Z. This conclusion is in agreement with experimental 
data. 

In addition, Eq. (26.11) explains the multiplet structure of x-ray 
terms. We note, first of all, that there is no splitting of the K 
term, since only one state (Isi/J is possible (/i=l, / = 0, y = 7 ‘ 2 )- 

For L terms, we have three components: Lir^2si/.^(n = 2, 1 = 0, 
/=Vi). i.ii~2pi/,(rt = 2, 1 = 1, /=Vj) and Lm ~ 2^3/, (n = 2. < = 1. 
/=’/.)• 

To make Eq. (26.11) agree with the experimental data for 
screening constants, we must set 

5,2, = 3, 53^ = 4. 

We obtain the following equations for the corresponding terms: 


|/" terms); 

— ^ (2—4)’ j {Ln terms); (26.12) 

^ (Z - 4)’ 1 (Z.,„ terms) . 


These relations are represented graphically in Fig. 26.5. 

In exactly the same way, it is easily shown that the M terms 
contain five components 

Mi(3si/j), Mu{3pi/^), 

Mu\(3p3/^), M\yj{3cl%/^), MY(3di/J, 

where the screening constants are 5;,, = 8,5, 53^ = 10, 53^ = 13. 
There are seven components for the N terms. 

The parallel doublet Lj and Lu [see (26.12)], which is due to 
the screening of the nucleus by the electrons is known as an irreg- 
ular doublet, whereas the diverging doublet h and Lm is known 
as a regular doublet. The reasons for the adoption of this terminology 
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go back to the first stage of the theory of multiplet splitting of 
x-ray spectra. It may be recalled that a theory of the fine structure 
was first constructed by Sommer feld, starting with a relativistic 
generalization of the Bohr theory. Sommer fold’s formula gave a 
correct value for the splittingof the spectral terms (see Chapter 2), 
but neglected the spin properties. For x-ray spectra this formula is 






R 


f- 


(Z-Snny 


4 ) » 


where = Applying this equa- 

tion to the analysis of the L terms, 
it can be shown that these terms split 
into only two components which corre- 
spond to the diverging doublet U and 
L\\\ in our notation. Thus, only this 
doublet was explained in Sommer- 
feld’s relativistic theory. Accord- 
ingly, it was called the regular 
doublet. The doublet L\ and Ln, for 
which no theoretical explanation was 
given for a long time, was called the 
irregular doublet. With the advent of 
the Dirac theory, the irregular doublet 
also found an explanation. Therefore, 
the reasons for the adoption of this 
terminology are entirely historical (just as for the ‘‘normal” and 
“anomalous” Zeeman effect). The correct theory of the multiplet 
structure of x-ray spectra (and of the anomalous Zeeman effect) 
was developed much later, when the spin properties of electrons 
were taken into account. 



Fig. 26.5. Multiplet structure of 
the x-ray L terms 


D. THE DISCOVERY OF MENDELEYEV’S PERIODIC LAW 

Upon arranging the known elements in order of increasing 
atomic weight, Mendeleyev discovered that various chemical 
properties tend to recur quasi-periodically. For example, the 
chemical properties of sodium, potassium and other alkali metals 
are similar to those of lithium, and the chemical properties of 
chlorine, bromine, iodine and so on, are similar to those of 
fluorine. 

Mendeleyev ascribed to each element a number Z (the atomic 
number) giving its position in the periodic system. Although the 
increase of Z is for the most part parallel to the increase of the 
mass number of the elements, there are several exceptions— for 
instance, (a^CO— ^gNi), (igAr— ^^K), (g^Te— b^I)— where the element 
with the smaller atomic number has a larger atomic weight. 
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Moreover, as we now know, there exists a large number of isotopes, 
that is, atoms having the same Z, but different masses (for ex- 
ample, iH', lD^ iT*). 

The periodic law has acquired particular significance in con- 
nection with recent discoveries concerning atomic and nuclear 
structure. In particular, the study of x-ray spectra and experi- 
ments on the scattering of a particles by atoms have established 
that the atomic number Z characterizes the charge of the nucleus 
(and the number of electrons in the neutral atom). 

Sixty-three elements were known when Mendeleyev discovered 
the periodic law (1869). He predicted the existence of ten more 
elements, and even described the basic chemical and physical 
properties of three elements which were subsequently discovered, 
namely, scandium (.iSc), gallium (^iGa) and germanium (.j^Ge). The 
inert (noble) gases were discovered at the end of the 19th century. 
In Mendeleyevas time, only three elements from the rare earth 
group (lanthanides) were known: cerium, didymium (a mixture of 
praseodymium and neodymium) and erbium. At the present time 
the properties oi all fourteen rare-earth elements have been 
investigated. 

By 1937, ninety-two elements were known, but four of these had 
not yet been observed. It was later found that these four elements 
were radioactive and virtually nonexistent in nature. They were 
produced in the laboratory as a result of nuclear reactions. 

In 1937, E. Segre produced an element with Z = 43, called tech- 
netium, by neutron bombardment of molybdenum. The half-life of 
its most stable isotope, was found to equal 2.6 x 10‘^ years. 

In 1938, it was first reported that an isotope of the last rare- 
earth element with Z = 6l had been produced as a result of deuteron 
bombardment of neodymium. The half-life of the most stable 
isotope of this element, , is about 20 years. 

In 1940, E. Segre discovered an element with Z-=^85 by irradi- 
ating bismuth with a particles. He called this element astatine. The 
half-life of its most stable isotope, is 8.3 hours. 

A short-lived element withZ = tS7, called francium, was dis- 
covered in 1939 by a Frenchwoman, Mile. Perey. This element is 
produced in the a decay of g-jAc^^^ The half-life of its most stable 
isotope, 87Fr'^’*, is equal to 22 minutes. 

And finally, we must mention that with the development of 
nuclear physics, it has become possible to produce transuranium 
elements. These range from neptunium (Z — 93) to lawrencium 
{Zz= 103).2 Thus, the periodic system now consists of 103 elements 
without any intervening gaps. 


2 

The largest number of transuranium elements has been discovered by G Seaborg and 
his students The discovery of lawrencium (Lw) with Z — 103 was recently announced. 
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E. FILLING OF THE ELECTRON SHELLS 

In quantum mechanics, the levels of the electron shells are 
filled in accordance with the following rules. 

(a) There can be at most one electron in each quantum state 
(PaulFs exclusion principle), and therefore the maximum number of 
electrons with a given / is equal to 2(2/+ 1) (see Chapter 25). 
Thus the s, p, d and f subshells can contain no more than 2, 6, 10 
and 14 electrons, respectively. 

(b) Electrons tend to occupy the lowest energy levels. There- 
fore, the shells with/ 2 =: I will be filled first, then those with n = 2, 
;i = 3,and so on. 

The shells would be filled in this way in an ideal scheme in 
which the wave function of an atomic electron could be calculated 
on the basis of the assumption that the charge of the nucleus and 
the charges of the Z — f electrons are all located at the center. In 
this case the energy levels of the remaining electron would be the 
same as in the hydrogen atom, and therefore they would be degen- 
erate with respect to /. As was shown in the investigation of alkali 
metals, however, the distribution of the electrons over a finite 
volume removes the / degeneracy, so that terms with a fixed value 
of the principal quantum number n (that is, the terms in a specific 
shell) are arranged in order of increasing L The s term is there- 
fore filled first, then the p term, and finally, the d term. 

Moreover, the 4s subshell is located below the 3rf subshell (and 
5s is below while the 6s subshell is below both the bd subshell 
and the 4 / subshell (similarly, 7s is below 5 /). It turns out that the 
outer shell (in an unexcited atom) can consist only of s and p sub- 
shells. The d and f subshells can be filled when they lie in the 
first or second inner shell, respectively (the first inner shell is 
taken to be the shell directly adjacent to the outer shell)? We 
shall make an attempt to substantiate this by an investigation of 
the ground-state configuration of the electrons in individual atoms 
(see Fig. 26.6). 


^The order in which the electron subshells are formed can be remembered most simply 
with the help of the following empirical rule: the levels are filled in the order in which the 
sum of the principal and orbital quantum numbers, n \ /, increases, and levels with the 
same value of this sum are filled in order of increasing n Since I takes the values 
0, 1, 2, , n — 1, we can find the rule for filling the terms in any shell For example, the 
fourth period (see below) will be filled in the order 

4s (n t / - 4), 3ci(n 1 / = 5), 4p (n + / 5) 

and the sixth period in the order 

6s(n + / - 6), 4f(n + I -- 7), 


5d(n + / - 7), 6p(n 4-^ = 7). 
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Within the first and second periods of Mendeleyevas system, 
the order in which the levels are filled conforms to the sequence 
of levels in the hydrogen atom (the ideal scheme). If this ideal 
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Fig. 26.6. Diagram of the filling of energy 
levels in the periodic system of elements. 
Only s and p subshells can be present in the 
outer shell of an atom. A d subshell can be 
filled only beginning with the first inner 
shell. An f subshell can be filled only be- 
ginning with the second inner shell. The 
filling of the 3d subshell gives the ferro- 
magnetic elements (Fe, Co, Ni). The fil- 
ling of the 4f subshell gives the lantha- 
nides or rare earths (sgCe — 7]Lu). The 
filling of the 5f subshell gives the actinides 
(g()Th — 103). The asterisks denote the atom- 
ic numbers of elements which have not yet 
been discovered. The maximum number of 
electrons in a given shell or subshell is in- 
dicated in parentheses. Emanation (Em) (the 
element with Z - 86) is also called radon (Rn). 


scheme were applicable to complex atoms, the 3d subshell would 
start to be filled beginning with potassium (Z— 19). According to 
the table given in Chapter 25, however, 5^ = 0,146, and S^ = 2,23, for 
potassium, and therefore the energy of the electrons in the 3d and 
4s states will be 



RU 

(3 — 0 . 146 )» 

■• 2 , 854 “’ 

£ — - — 

Rn 

(4 — 2 . 23 )= ~ 

\jr • 


( 26 . 13 ) 


It can be seen that and hence the 4s level will be filled 

before the 3d level. Consequently, the third period will contain only 
seven elements (,iNa — i^Ar), just like the second period. 
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After the 4s subsheli is filled in Ca (Z = 20)> one might expect 
that the filling of the sub shell would begin with scandium (Z = 21). 
Spectroscopic data show, however, that in elements ^iSc — ^sNi the 
3d subshell is filled first. This subshell becomes filled at the ex- 
pense of electrons from anCu to 30 Zn of the 4s subshell, which then 
must be refilled after all available states in the Zd subshell are oc- 
cupied. Only after that can the refilling of the 4p subshell start. 
Thus the fourth period contains 18 elements and consists of the 4s, 
3d and 4p subshells (see Fig, 26.6). The fifth period repeats the 
fourth period (the 5s, 4d, 5p subshells are filled), and thus it also 
contains 18 elements ( 37 Rb“ 64 Xe). 

The sixth period contains 32 elements ( 8 »Cs — geRn), because 
besides the outer shell, consisting of 6 s and 6p states, the first 
inner sub shell (ten electrons) and the second inner sub shell 4/ (the 
14 electrons of the lanthanide or rare-earth elements) will be filled, 
hi exactly the same way the seventh period should repeat the 
sixth period; that is, it should contain 32 elements (the 7s, 5/, 6 d, 
7p subshells). So far, however, only 17 elements of this period 
have been discovered. The so-called actinides, in which the second 
inner subshell 5/ is filled (goTH— element 103) should have properties 
similar to those of the lanthanides. 

The first period therefore contains two elements, the second and 
third periods eight elements each, the fourth and fifth periods 18 ele- 
ments each, and the sixth and seventh period 32 elements each 
(except that the seventh period is incomplete). The order in which 
the states are filled is illustrated in Fig. 26.6. 


F. APPLICATION OF THE THOMAS-FERMI METHOD TO THE 
THEORY OF THE PERIODIC SYSTEM OF ELEMENTS 


We shall now attempt to treat the ground-state configuration of the elements more 
rigorously. 

In a paper devoted to the statistical theory of the atom (1928), Fermi proposed a 
method, now known as the Thomas-Ferml method (see Chapter 25), to explain the periodic 
system of elements. With this method, he obtained the minimum values of Z for which 
s, p, d and / states are filled in atoms. He obtained these values by starting from the 
following quasi-classical ideas. 

In classical theory the angular momentum of a particle L is related to the momentum 
p by the expression 

L= rxp. 

Consequently, 






where Pn is the component of momentum perpendicular to the position vector r. 

Obviously, the square p'j^ of this component of momentum cannot be greater than the 
square of the maximum momentum, which we shall denote by P. Therefore for a given P 
and r the possible values of the angular momentum L must satisfy the inequality 

D 
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It was shown in Chapter 13 that, in the quasi-classical treatment of the atom, the 
square of the angular momentum must be [see (13.62)] 


r = + (26.15) 

This relation represents a compromise between the Bohr relation / “ = 1i^ (/ -[- 1 )“ and 
the quantum mechanical relation = /i*/ (/ + 1) , ^ 

The maximum momentum P is related to the density of the electron gas (electrons in 
the atom) poby the expression (6.32) 




( 26 . 16 ) 


The electron density po may be found from the Tliom as- Fermi equation (see Chapter 25), 
which, as we have already indicated, can be solved only by approximate or numerical 
methods, A good approximation for po, as follows from a solution of the Thomas-Fermi 
equation, is provided by the expression [see (25.21)] 




9 


(26.17) 


where the coefficient \ is found by the Ritz variational method. 

Substituting these values for P^and /.* into the inequality (26.14), we get 


-I V)<r ^ + 

\ 16 j r ^ ^ 


(26.18) 


Introducing the new variable Ir = we find 


where 



(26.19) 



(26.20) 


From the inequality (26.19) it is evident that the right-hand side of (26.19) becomes 
greater than the left-hand side as a* — 0 (r — 0) and as x oo. The electrons in the atom 
therefore can have a given value of / if a: lies in the range a'i < a < .v^ for which the 
Inequality (26. 19) is satisifled. Here a*i and A'a are roots of the equation 



(26.21) 


The condition for the appearance of states with a given value of I is the equality of 
both roots 


Ai = As 

In this case we should equate not only the two functions themselves, but also their de- 
rivatives, Then, in addition to Eq, (26,21), we will have 
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These two relations will be satisfied for 


Vx=3. 


that is, when 


D = 9e-^. 


Substituting the value for D from (26.20), we find the value of Z at which electrons with a 
given / will first appear 


Z = — (21 -h 1)> = t (2/ + \)\ (26.23) 

where a = 2.718 ... is the base of the natural logarithms and the coefficient 7 is equal to 
0.158. 

A numerical solution of the Thomas-Fermi equation gives a very similar value for 7 

7 T-F ~ 0.155. 

This again is a convincing demonstration that the density (26,17) represents a good 
approximation to the density which is given by a numerical solution of the Thomas- 
Fermi equation. 

Equation (26,23) enables us to calculate the Z values at which the s, p, cf, and / states 
begin to be filled. ITie results of this calculation are given in Table 26.1. ITie first row 
gives fractional values of Z computed from formula (26,23) with 7 
values of Z, calculated with 7 q^_p = 0.158, for which s,p, d, and /states first appear 
are practically identical with those calculated with 7 0,155. The second row gives 

the nearest greater integral value of Z. The last row of the table gives the empirical 
values of Z at which the states first appear, and also the symbol of the corresponding 
elements. 


Table 26.1 

Atomic numbers at which the s, p, d, and / states first appear for a given / 


/ 

s 

0 


d 

2 

/ 

3 

Theoretical value 

7 (0.15 

4.2 

19.4 

53.2 

(Thomas-Fermi) 


5 

20 

54 

Empirical value 

Z 1 (H) 

5(B) 

21 (Sc) 

58 (Ce) 


From this table it can be seen that this approximate theory is in good agreement 
with the experimental data. We may note that complete agreement is obtained if the co- 
efficient 7 is taken to be 0. 169 instead of 0. 155, 

It is well known that in light elements (Z= I, 2, 3, 4) only s states are filled. The 
filling of the p states begins with boron (Z == 5); this is in complete agreement with 
theoretical data. Table 26. 1 shows (in spite of the crudeness of the statistical model) 
that the filling of the Sd subshell does not begin, as might be expected, in potassium 
(Z = 19), but in scandium (Z — 21); that is to say, it does not begin until the 4s subshell 
is completely filled. Similarly, the Thomas-Fermi model explains the ''delay" In the 
filling of the 4/ subsheil, which might be expected to begin in Ag (Z = 47). According to 
the theory, however, the filling of the 4/ subshell should be shifted, and should begin only 
in cerium (Z = 58), It follows from (26.3) that the filling of the 5g subshell (/ — 4) would 
begin in the element with Z = 1 24 

llie Thomas-Fermi model accounts for a very important feature of the ground-state 
configuration of atoms and explains the departure of the filling of the levels from the 
ideal scheme (the hydrogen scheme) in terms of the "smearing" of the electron cloud. 
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G. PERIODICITY IN THE PROPERTIES 
OF ELEMENTS 

The periodicity in the properties of the elements can be ex- 
plained quite naturally in quantum mechanics. It is connected with 
the periodic nature of the filling of the outer shell, which contains 
at most eight electrons {s and p states) and which determines the 
chemical and optical properties of atoms. All elements, therefore, 
can be divided into eight groups, depending on the number of elec- 
trons in the outer shell. 

The elements in Group I (hydrogen and alkali metals) have an 
outer shell containing a single electron. As a result, the optical 
terms (except the s term) are doublets and the elements are 
monovalent, as will be shown below. The elements of Group II— the 
alkaline earth metals (beryllium, magnesium, calcium, etc.)--have 
two valence electrons; their spectral terms must therefore be 
singlets and triplets, and the valence is equal to two. The elements 
of Group III have an outer shell containing three electrons, and 
therefore the maximum splitting of their optical terms must be 
equal to four (quartets); their maximum valence is three. 

On the contrary, the elements of Group VII— the halogens 
(fluorine, chlorine, etc.)— lack just one electron to fill the outer 
shell completely. Therefore, in addition to the maximum (positive) 
valence of seven, they may have a negative valence of — 1 (the 
number of electrons required to obtain a stable configuration). 
They exhibit this valence in the so-called ionic compounds (see 
Chapter 27). 

Finally, in the inert gas group (neon, argon, krypton, etc.) the 
outer shell is completely filled. We may even say that there is no 
outer shell in these elements, because the energy bonding these 
electrons in the outer shell is larger than that bonding all other 
shells in the molecule and thus it would be more correct to ascribe 
it to the inner shell. Thus, these elements may be assigned to the 
zeroth^’ group. The elements of the zeroth group do not as a 
rule enter into any chemical reactions, and thus these elements 
are said to be chemically inactive. 

There are, however, a number of exceptions to the rule that 
there are eight elements in each period. The first exception is 
constituted by hydrogen (Z= 1) and helium (Z==:2), which form the 
first period. In this period there are only two elements, and not 
eight. This is due to the fact that the K shell does not include p 
states. Consequently, the properties of these elements are of a 
dual nature. Because there is only one electron in the outer shell, 
hydrogen should have the same chemical and optical properties as 
the alkali metals. Indeed, Just like in these elements, the maximum 
splitting of the spectral terms of hydrogen is two, and its valence 
is one. Hydrogen, however, also resembles the halogen group in 
that it lacks just one electron for a complete outer shell, and it can 
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therefore acquire a second electron forming, in the same way as 
the halogens, a negatively charged ion. 

Helium resembles the alkaline earth metals of the second group 
in that it also has two electrons in the outer shell. The spectral 
terms of both helium and the alkaline earth elements are either 
singlets (spin 0), or triplets (spin 1), However, in its chemical 
properties, helium is a typical representative of the inert gases, 
because its outer K shell is completely filled; hence it does not 
participate in normal chemical reactions. 

The maximum valence of elements is determined, as a rule, by 
the number of electrons in the outer shell; that is, the valence of 
atoms varies from unity (for atoms of the first group) to seven 
(halogen group). There are, however, certain exceptions among the 
elements in which the inner shells are filled after the outer shells. 

It can be seen from the periodic table that there will be two 
electrons in the outer shell of all elements from scandium (Z = 2\) 
to nickel (2 = 28), with the exception of chromium, where there is 
only one. Owing to transitions of electrons from M states to 4p 
states, however, the maximum valence of scandium (Z = 2I) is equal 
to three, and that of manganese (Z = 25) is equal to seven. Conse- 
quently, it was necessary to place these elements in groups corre- 
sponding to their maximum valences (this was correctly done by 
Mendeleyev). Mendeleyev placed iron (Z = 26), cobalt (Z = 27) and 
nickel (Z = 28) in a special group (Group VIII). The introduction of 
this group is justified from the point of view of modern quantum 
mechanics, since at most eight electrons can occupy the outer 
shell. In general, however, iron behaves either as a bivalent or 
trivalent element. All of these elements have similar properties. 
In particular, they have distinctive ferromagnetic properties caused 
by uncompensated spins of the 3d electrons in the inner shell. The 
presence of these states is due to the fact that from the energy 
standpoint, the 3d state is more favorable during formation of the 
crystal lattice than the other states in which the spins of the elec- 
trons can be compensated. 

In the elements following the ferromagnetic, the 3d subshell is 
the first to be completely filled; the filling of the levels then con- 
tinues in the 45 and then the 4p subshells. Krypton completes the 
structure of the M shell (n = 4) ; therefore its optical and chemical 
properties will be characteristic of the inert gases. 

As we have already mentioned, the fifth period, which extends 
from the alkali metal rubidium (Z = 37) to the inert gas xenon 
(Z = 54), is a repetition of the fourth period and exhibits no new 
features. 
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Quantum theory also explains the characteristic properties of 
the elements in the lanthanide series (the rare-earth elements), 
which comes immediately after lanthanum and extends from cerium 
(Z==58) to lutetium (Z = 71). The elements in this series are 
formed by consecutive addition of electrons to the deeper 4/ sub- 
shell (the second inner N shell), even though the first inner shell 
(0) and the outer shell (P) are still incompletely filled. Since the 
chemical properties are determined mainly by the electrons of the 
outer shells, all 14 rare-earth elements are much closer with re- 
gard to chemical properties than are the elements in which the 
first inner d subshell is filled. 

For a long time hafnium (Z — 72) was also Included in the 
lanthanide series. A theoretical analysis performed by Bohr 
showed, however, that there can be at most fourteen elements in 
this group (the possible number of /-states) and that, therefore, 
hafnium must be a chemical analog of zirconium. Careful exper- 
iments have confirmed this theoretical conclusion. 

The actinide series in the seventh and last period is analogous 
to the lanthanide group. Begiining with thorium (Z — 90) , the ele- 
ments of this series are formed by the consecutive addition of 
electrons to the deep- lying 5/ subshell of the 0 shell, while the 6^, 
6/; and 7s subshells remain completely filled and the Qd subshell is 
partially filled. The actinides include protactinium (Z = 91), uranium 
(Z = 92) and also the following artificially produced transuranium 
elements : neptunium (Z = 93), plutonium (Z = 94), americium (Z = 95), 
curium (Z = 97), californium (Z = 98),. einsteinium {Z=: 99), fermium 
(Z= 100), mendclevium(Z=101), nobelium (Z=:102) and the recently 
discovered element 103 (lawrencium)f 

The question of how many elements can be produced by artificial 
means and experimentally detected, and the question of where the 
periodic system ends, have not yet been finally answered. It is 
clear, however, that the periodic system ends because of he in- 
stability of nuclei (due mainly to their spontaneous fission). 


^The discovery and properties of the new elements are discussed in G. T. Seaborg, 
The Transuranium Elements, Reading, Mass., Addison-Wesley, 1958 
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The Theory of Simple Molecules 

A. BASIC FORMS OF THE CHEMICAL BOND 

The chemical properties and the optical spectrum of an 
element are determined mainly by the outer electrons of the 
atom. Therefore, the regularities in the structure of the outer 
shelly which account for the optical periodicities, also provide 
a basis for the construction of a theory of the periodically re- 
current chemical properties of the elements. It should be noted 
that the chemical properties, unlike the optical properties, are 
not exhibited by Isolated atoms, but appear only in the presence 
of other atoms, with which the atom forms chemical compounds. 

The inner electrons have almost no influence on chemical 
processes, since they are much more strongly bound to the 
nucleus than the outer electrons. Chemical reactions therefore 
liberate much less bonding than the energy of the inner elec- 
trons. 

In discussing the chemical properties of an atom, we must 
distinguish between two main types of chemical bonds: ionic (or 
heteropolar) and atomic (or homopolar). We shall consider both 
of these types in greater detail. 


B. HETEROPOLAR MOLECULES 

Inorganic salts consist of positive and negative ions held to- 
gether by an electrostatic (Coulomb) attraction to form a mole- 
cule. Compounds of this type are called ionic, and their molecules 
are said to be heteropolar. The ions may be either positive or 
negative. The sign of the charge on the ion depends, on the one 
hand, upon the ionization potential, that is, the energy that must 
be expended in order to remove an electron from the outer 
shell; and, on the other hand, on the electron affinity, that is, 
the energy which the atom must acquire to hold an additional 
electron in the outer shell. 

Let us assume that a neutral atom with atomic number Z 
contains M electrons in the inner orbitals and Z^ — Z — N elec- 
trons in the outer orbitals. The electrons in the inner orbitals 
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will completely screen a corresponding fraction of the nuclear 
charge, but will do so only in the region outside this inner shell 
(starting at the outer shell). Thus, the Coulomb potential energy 
holding the electrons in the outer shell is 



However, inside the atomic core the charge will be Z,>Z^; that 
is, the screening of the nuclear charge will not be complete (see 
Chapter 25). In exactly the same way, the outer electrons will 
completely screen the remaining part of the nuclear charge (Z^^eo ) 
only in the region outside the outer shell (that is, only in the case 
of excited states). While in this case there appear polarization 
forces proportional to they are not able to hold an additional 
electron. In the outer shell Itself the charge will be incompletely 
compensated; for this reason (but provided there are unfilled 
states in the outer shell), the incompletely screened part of the 
nuclear charge will hold additional electrons in this shell, thus 
forming a negative ion. The rule is that the less electrons there 
are in the outer shell of a neutral atom, the larger the total 
screening of the nuclear charge in this shell. Therefore, an alkali 
metal will lose the one electron in its outer shell more readily 
than it will acquire additional electrons, 

A curve showing the dependence of the ionization potential on 
Z is plotted in Fig. 27,1. It shows a minimum for alkali metals and 
a maximum for inert gases. This curve reproduces rather faith- 
fully the periodicity exhibited by the number of electrons in the 
outer shell. 

In inert gases the ionization potential reaches its largest 
value; the removal of an electron from the outer shell and its 
transfer to another atom require a very large expenditure of 
energy. In addition, no further electrons can be held in the outer 
shell, which is completely filled. Therefore, inert gases do not 
participate in ordinary heteropolar compounds (we shall also see 
that they do not form homopolar bonds), and hence, as a rule, 
they exist as unassociated atoms. 

Atoms of alkali and alkaline earth metals readily give up their 
valence electrons to another atom (the ionization potential is at 


^For example, in sodium (Z 11), the ten electrons in the inner shell completely 
screen ten units of nuclear charge, which leaves only the eleventh unit of nuclear charge 
to be (partially) screened by the outer electron. In chlorine (Z ^ 17) the ten inner elec- 
trons completely screen only ten units of nuclear charge, so that the seven electrons of 
the outer shell must screen the remaining charge, which they can accomplish only partially. 
Therefore, a chlorine atom is able to hold an additional electron more easily than sodium, 
and IS thus converted into the negative ion Cl . On the other hand, a sodium atom gives 
up its outer electron more readily, and in this way forms a positive ion Na^. 
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its minimum here), and thus convert to positive ions (for example, 
a Na”*’ ion). 



Fig. 27.1, The dependence of the ionization po- 
tential of a neutral atom on the atomic number 

On the contrary, atoms in Group VI (including oxygen) and in 
Group VII (halogens), and also hydrogen (which resembles Group 
VII with regard to the number of missing electrons), have a 
higher electron affinity than the other elements (see Table 27.1). 
The electron affinity of sodium is practically equal to zero, like 
that of inert gases. 


Table 27.1 


Electron Affinity 


Element 

Electron affinity 
(ev) 

H 

0.71 

F 

4.13 

Cl 

3.72 

O 

3.07 


The first successful attempt to construct a theory of the ionic 
bond was due to Kossel (1916), who made use of the Bohr theory 
of the atom. 

KosseEs theory was based on the fact that the eight- electron 
shells of the inert gases are closed, so that these atoms have 
zero valence. Positive valence (or valence with respect to hydro- 
gen) is determined by the number of electrons in the outer shell 
in atoms in which these electrons are readily lost (atoms of 
Groups I or II). Negative valence (valence with respect to fluorine 
or twice the valence with respect to oxygen) is determined by the 
number of electrons which the atom can acquire, that is, the 
number of vacant states in the outer shell (see Chapter 26). 
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Negative valence is particularly pronounced In elements of Groups 
VI and VII, although both types of valence may be exhibited by a 
given element. For example, in the typical heteropolar compound 


Na Cl 

Fir. 27.2. Two neutral independent atoms 
(Na and Cl). The black dots indicate 
electrons, the light dot indicates a state 
which can be occupied by an electron 
owing to the electron affinity of the atom. 

HCl, chlorine has a valence of -1, although other compounds are 
possible in which chlorine has a valence of +7. An example of 
the latter is CI 3 O 7 . We do not intend here to develop a complete 
theory of the chemical bond, and shall restrict ourselves to an 
examination of one typical ionic molecule, namely, the hetero- 
polar molecule NaCl.^ The energy bonding an outer electron in 
atomic sodium is 5.1 ev.^ When the valence electron of sodium is 
transferred to the outer shell of the chlorine [that is, durir^ 
formation of Na"*" and Cl~ ions (see Fig, 27.2)], it carries some- 
what less bonding since its affinity for chlorine {Wqi = 3.7 ev) 
is somewhat lower than the ionization energy of sodium ( U?" 

5,1 ev). However, in the formation of the molecule, this deficiency 
is compensated by the Coulomb energy of attraction between the 
Na^ and Cl" ions (see Fig. 27.3). 

The total energy bonding these atoms in the NaCl molecule is 
given by the expression 

This energy has been very carefully determined from experiment 
and found to be equal to: II/njci — 4,2 ev. Hence for the Coulomb 

2 

See G Herzberg, Atomic Spectra and Atomic Structure, New York, 1944 
^The bonding energy of an electron in an atom (or molecule) is equal to the energy that 
must be expended in order to remove an electron It is, therefore, equal to the negative of 
the energy holding the electron in the compound (W - — V), that is, it will be a positive 
quantity 
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bonding energy between the ions we find 

rCoul=--~ ^^ion u^bond^ 3 ^ 

Since ~ we obtain a perfectly reasonable value 

for the interatomic distance in the NaCl molecule: /?== 2.5 • 10’'® 
cm. 



Fig. 27 3 The formation of an NaCl molecule 
from Na^ and Cl"" 10 ns The ionization poten- 
tial of sodium (5 1 ev) and the affinity of the 
chlorine atom for an electron (3 7 ev) are indi- 
cated in parentheses The Coulomb bonding 
energy between the ions in the molecule is 
5 6 ev. 

It should be noted that we have not considered here all the in- 
teractions that occur in a heteropolar molecule. In addition to the 
Coulomb forces of attraction, there will also be repulsive forces; 
these exceed the Coulomb forces at small distances and prevent 
the two atoms from approaching closer than the distance R, In 
any case, this elementary discussion explains the principal physical 
processes involved in the formation of heteropolar molecules; it 
also explains, however qualitatively, the ionic structure of their 
crystal lattice and the dissociation of these molecules into indi- 
vidual ions, a process which occurs in solutions. 


C. THE MOLECULAR HYDROGEN ION 

Aside from ionic compounds, there exist molecules formed 
directly from the neutral atoms, rather than from ions. The simplest 
representatives of these molecules are Ha, O 2 and Na. These are 
called homopolar molecules. 

The formation of homopolar molecules cannot be explained 
on the basis of classical theory or Bohr's semiclassical theory. 
These theories are useful only for compounds held together by 
electrostatic forces such as, for example, ionic compounds. 
Before discussing the formation of homopolar molecules, let us 
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consider a very simple case, namely, the molecular hydrogen ion 
Ha , which consists of two hydrogen nuclei and a single electron. 
This analysis is important for methodological reasons, because it 
enables us to express in comparatively simple mathematical form 
the features of the bond that arises between two hydrogen nuclei 
owing to the exchange of an electron (exchange forces). The same 
forces also appear in the homopolar hydrogen molecule H 3 . 

Let us denote the distance between the two hydrogen nuclei a 
and a' by R and assume that R changes adiabatically; that is, R 
changes so slowly that it can be regarded as a constant in solving 
the Schrddinger equation. In a more exact treatment, it is neces- 
sary to take into account the vibrations of the nuclei about the 
equilibrium position (the vibrational spectrum), and the rotation 
of the nuclei about the center of mass (the rotational spectrum). 
These questions have been treated in detail in Chapter 12. 

Suppose r and r' represent the distances between the electron 
and the nuclei a and a ! , respectively. The Schrbdinger equation 
for the ionized hydrogen molecule can then be written as 

(J? — H)tf==0. (27.1) 

Here the Hamiltonian is 

H = T--^- - , (27.2) 

and the kinetic energy operator has the form 


T = 


\_ 

2mo 




(27,3) 


where = = vi = -^) since 

r'=r—R, 


(27.4) 


and R may be regarded as a constant in the problem under con- 
sideration. 

We shall restrict ourselves to an investigation of the ground 
state, and carry out our calculations with the help of perturbation 
theory. The spin effects can be neglected in the case of a single 
electron. 

In the zeroth approximation we assume that the electron is 
under the influence of either nucleus a or nucleus a' (see Fig, 27.4). 
The Schrddinger equations describing these two possible unper- 
turbed states are 


(£„-T+ = 

(£a'-T-t-3-)l.a' = 0. 


(27.5) 
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Both eigenvalues and eigenfunctions are identical and correspond 
to the Is state of the hydrogen atom. Since one of the wave 
functions is associated with nucleus a, and the other with nucleus 
a\ we may write / = m = 0) 


Ea — Ea> = Ei = — Rh, 


(27.6) 


where tnfm == +100 is the wave function of 
the ground state of the hydrogen atom 
[see (13.32)], namely, 


4^100 (^) — 


«o. 


(27.7) 


The total energy of the system and the 
zero-order eigenfunction are as follows: 


= — Rh^ 


(27,8) 


The uncertainty in is due to the fact 
that the presence of two nuclei leads to a 
degenerate state of the system. In solving 
(27.1)by perturbation theory, we must set 


/ 



Electron 1 is near 
nucleus a 



Q R a‘ 

Electron 1 is near 
nucleus a' 


Fig. 27.4. Diagram of the inter- 
action of particles in the H 2 
molecular ion The solid arrows 
represent the interaction of the 
particles m the zeroth approxi- 
mation The arrows depicted by 
a dashed line represent the in- 
teraction corresponding to the 
perturbation 


£ = 


(27.9) 


Substituting (27.9) into (27.1) and restricting ourselves to the first- 
order quantities, we find 




(27.10) 


pS 

For the solution (the electron near nucleus a), represents the 
main interaction, and - the perturbation. On the contrary, for 

^ p2 

the solution (the electron near a'), the main interaction is--% 
and the perturbation is - y . 

From Eq. (27.10) we can find the additional energy £', and 
also a relationship between the coefficients and in the wave 
function This can be done because the perturbation energy 
removes the degeneracy (just as in the case of the helium atom). 
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To solve this problem, we can make use of the theorem which 
states that the solution of the homogeneous equation [that is, one 
of the solutions of Eqs, (27.5)] must be orthogonal to the right-hand 
side of the same equation, that is, Eq. (27.10). Assuming that the 
electron is near the nucleus a, we can neglect the perturbation 

energy p on the left-hand side of the equation, since it gives us 

a second-order term when multiplied by Then the solution of 
the homogeneous equation will be the function and according to 
the theorem we have just stated, it must be orthogonal to the right- 
hand side of Eq. (27.10) 


c. J’ (f + ^ (f • 




(27.11) 


Here we have used the fact that the wave function of the ground 
state of the hydrogen atom is real. 

In exactly the same way, assuming that the electron is near the 
nucleus a', we may neglect the perturbation energy on the left- 

hand side of Eq. (27.10) (in this case the perturbation energy is p - ). 

Since the solution of the homogeneous equation will now be the 
function we obtain a second equation for the unknown quantities 


-f C, j (f +-f %.d'K = 0 (27.12) 


When r is replaced by becomes whereas in the reverse 
substitution becomes 6,,. In both cases, the volume element 
(Px remains unchanged {d'x~d^x') . Therefore, we can reduce 
(27.12) to a form that is in agreement with (27.11), but with the 
coefficients C\ and interchanged 


-I - C, J ( F -I - f ) %.d'x = 0. (27.13) 

In further transforming Eq. (27.11), we must not forget that the 
function is normalized to unity 

(27.14) 

but is not orthogonal to the function 

^ <l'a'i'a-«Pjf = S(/?). 


(27.15) 



THE THEORY OF SIMPLE MOLECULES 


445 


This follows from the fact that, although the expression S(/?) 
vanishes for /? -> oo (there will be no points where the two functions 
and both differ from zero), it becomes equal to unity for 
R = 0 

Let us introduce the notation 


and 



A = (27.16) 


where K Is the Coulomb energy of interaction between the hydrogen 
atom and the (atomic) hydrogen ion, and /I is the exchai^e energy, 
which has no classical analog. This energy arises because the 
electron can be in both states and <(<o, simultaneously (that is, 
exchange occurs between states a and o' ). Formally, the existence 
of the exchange energy Is reflected in the fact that the expression 
for ip contains to * as well as As we shall see in what follows, 
it is this exchange energy which gives rise to an attractive force 
between the nuclei. At certain values of the internuclear separation, 
this force exceeds the force of repulsion, and as a result the Ht 
molecular hydrogen ion is formed. It is worth notli^ that this 
mixed state cannot arise in the Bohr theory, and therefore the 
existence of an ionized hydrogen molecule can be explained only 
in terms of quantum mechanics. 

With the help of (27.14)-(27.16), we can reduce Eqs. (27.11) 
and (27.13) to the form 


C,(F-/() + C,(E'5-A)=0, 
C,(£'S-A) + C,(£'-A:) = 0. 


(27.17) 


obtaining two solutions: a symmetric one (Ci==C.j) and an antisym- 
metric one (Ci = — Cj). The symmetric solution is 


(|>S: 
£'S = 


= l/S ()?) = /( 


(ta +'}'«'). 

I A-SK 
1+S ' 


(27.18) 


and the antisymmetric solution is 


Y 2(1 —sj 


=V‘ (R) = K 


A — SK 
)— S • 


(27.19) 
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The factor is the normalization coefficient. We may note 

here that the symmetric and antisymmetric solutions are alreacfy 
mutually orthogonal 


s 




The quantity may be regarded as an additional interaction V (/?), 
which binds the H atom and the ion into a stable hydrogen 
molecule. In order to find the specific form of this interaction as 
a function of /?, we must calculate the values of S, K and A. For 

this purpose we use the well-known expansion of the function —y- 

into a triple Fourier integral 


_ I C 
r J 


(27.20) 


After differentiating this expression with respect to the parameter 
k\y we can easily write the wave function for the ground state of the 
hydrogen atom in the form of a Fourier integral 


r 



.1 hr 

- d'^h 

{k^ + kiY “ 


(27.21) 


where . 

In accordance with (27.15), the expression for 5 becomes 

s = J ir) Ij-.oo {r - /?) d^x (27.22) 

We may replace and tl>ioo(r — R) by their expressions in 

terms of the Fourier integral (27.21). Integrating with respect to 
volume, we obtain 


^ e «*'+*) r (Px = Sit'S (ft + ft'), 

and thus we find the following value for S: 



5 






_ R 

e , 


(27.23) 


(27.24) 
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The integral in this equation can be calculated with the aid of Eq. 
(27*20), which should be differentiated three times with respect to 
the parameter /?; . 

In order to calculate the additional Coulomb interaction, we 
shall use the relation 

= = 8“3 j 


where we must set =—• Then, putting Ajo— 0 in (27.20), we obtain 

In exactly the same way, we may represent the quantity \!R in the 
form of the integral (27.25). Substituting these quantities into 
formula (27.16) and integrating over the volume d^x with aid of 
(27.23), we find 




2 

Hence, using (27.20) and setting we obtain an expression 

for the additional Coulomb energy 

K = (27.27) 

In calculating the exchange interaction 

^ J S - e3 J <1*1.0 (r - R) cPx (27.28) 

we may use (27.21) for — R) and (27.20) for 


tiooW 

r 


kV^ 


J hr 


J /t* f Ag 




where ^o = ". Hence 
“0 


■S o f 


(ft* + ft;)* ' 

K 


(27.29) 
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Substituting the values found for S, K and A into the expression 
for the interaction energy [see (27.18) and (27.19)], we obtain the 
following expression 


V 



!+(•+£+ y 


(27.30) 


Here the upper signs (-) refer to the antisymmetric solution 
while the lower signs (+) refer to the symmetric solution. For 

small values of R (R aj, we have 


!/•» =^when/? 'a„, (27.31) 


and thus the energy, as we would expect, 
is determined by the Coulomb energy of 
repulsion between the two nuclei. For 
large distances (R^aJ we have 


(27.32) 


that is, the antisymmetric solution (+) 
gives a repulsion, whereas the sym- 
metric solution (-) gives an attraction. 

The general nature of the varia- 
tion of and as a function of R is 
depicted in Fig. 27.5 which also shows 
the experimental data on the inter- 
action energy. It can be seen from 
this figure that only the symmetric 
state is realized in practice. 

Theoretical values obtained from the graph give the equilibrium 
distance as 2.50^0= 1.32 A, and therefore the ionization 

energy is 

D-- -F(/?,) = -f 0,0640^ = 1.76 ev. (27.33) 

</() ' 

The corresponding experimental values are 

R^^P =. 1,06 A, 79 ev (27.34) 

(the zero-point energy of oscillations is not included in the 
theoretical and experimental values given here). 



R 

Fip. 27.5. Curves of the interaction 
energy in the molecular hydrogen 
ion as a function of the distance H 
between the nuclei (in units of Oq) 
for the symmetric (V**) and anti- 
symmetric (V**) states 
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The discrepancy between the theoretical and the experimental 
data is due to the fact that here, just as for the helium atom, the 
perturbation energy is commensurable with the energy of the 
zeroth approximation. Solving this problem by the variational 
method, using a test function of the form 


( ICtfJ j 


" rto 


(27.35) 


where Z', the effective charge of the nucleus, is taken as the 
variational parameter, we can obtain values for /?« and D which are 
in considerably better agreement with experiment 


/?var^l 06A, Dvar==2.25 ev. 


If several parameters are introduced, the variational method gives 
results that are in practically complete agreement with the 
experimental data. 

It can be seen that the formation of the molecular hydrogen 
ion is essentially due to the quantum- mechanical exchange forces, 
which in the symmetric state give rise to a stable molecule. 
From the physical point of view, this can be explained as follows. 
The probability of the electron being in the symmetric state is 

pS + (27.36) 

whereas the probability for the antisymmetric state is 

p3 = y = -27]4rs)- (t« + - 2Wa.). (27.37) 

If we plot the curves of constant probability density of the electron 
(see Fig. 27.6), we see that the electron tends to be located at the 
midpoint of the line joining the two nuclei in the case of the sym- 
metric solution, whereas in the case of the antisymmetric solution 
the position probability vanishes at this point. Since the electron 
binds the two nuclei most strongly when it is halfway between them, 
it is natural to expect that the first solution, and not the second, 
will lead to the formation of a molecule. Moreover, in the case of 
the symmetric solution, the curves showing the electron distribution 
about the nuclei tend to merge when the nuclei approach one another; 
this provides a graphical characterization of the homopolar bond."* 


"^For more details see P. Gombas, Theorie und hosungsmethoden des Mehrteilchen^ 
problems der Wellenmechantk, Basel Birkhauser-Verlag, 1950. 



450 


RELATIVISTIC QUANTUM MECHANICS 


Two hydrogen nuclei, or a hydrogen and a deuterium nucleus, can 
also be linked by other particles besides electrons. We may 
mention in this connection the /i-mesic molecule (HD)', in which the 
bonding between the hydrogen and deuterium nuclei is brought 
about by a negative u meson. Alvarez produced a mesic molecule 
of this type by passing negative ^ mesons through a bubble chamber. 
The radius of such a molecule, as calculated from the equation 

/? 1 = 2.5 will be 1/200 the radius of the molecular hydrogen 

///(, Cjj 




Fig. 27.6. Electron density distribution in the molecular 
hydrogen ion. 


ion, since the mass of the u meson is approximately 1/200 as 
large as the electron mass. Thus, when the nucleus of the hydro- 
gen atom approaches the nucleus of the deuterium atom, they 
form a common nucleus, namely that of the >IIe‘ molecule 

As a result, an energy of 5.4 Mev is released and carried away by 
the /i meson. Thus the n meson acts almost as a catalyst of the 
nuclear reaction. 


D. HOMOPOLAR ATOMIC MOLECULES 

The first successful attempt to give a theoretical explanation of 
the homopolar molecule was made by Heitler and London (1927) 
with the help of quantum mechanics. In a homopolar molecule, 
exchange forces play a fundamental role. In their treatment, 
Heitler and London used perturbation theory, which does not give 
completely accurate quantitative results. Although more accurate 
quantitative results can be obtained by means of the variational 
method, the Heitler-London theory enables us to bring out in a 
very simple way the physical features of the homopolar bond. 

Let ri and denote the position vectors of the first and second 
electrons relative to nucleus a, and rj and ri the position vectors 
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of the electrons relative to nucleus a' (see Fig. 27.2). Then 
— /?, r\ = r<t — R. In the zeroth approximation we obtain two 
wave functions which are products of the ground- state hydrogen 
wave functions 


(n) (r^) = (n) '^loo — /?), 

(^l) (^2) 4>ioo (^1 — R) ^^100 2)' 


(27.38) 


The first solution corresponds to the case when the first electron 
is near the nucleus a (and the second electron near the nucleus a'), 
while the second solution corresponds to the case when the 
first electron is near the nucleus a\ and the second electron is 
near the nucleus a. Both these possibilities are depicted in Fig. 
27.7, where the solid arrows show the atomic bonds, and the 
dashed lines show the molecular bonds. When the distance between 
the nuclei tends to infinity (i?->cx5), all molecular bonds vanish. 


/ f}2 



Fig. 27,7. Diagram of the interactions 
in the H 2 molecule The solid lines 
join the particles whose interaction is 
taken into account in the zeroth ap- 
proximation The dashed lines denote 
interactions which are regarded as 
perturbations, a and a' are the nuclei 
of the hydrogen atoms, 1 and 2 are 
electrons 


Just as in the problem of the molecular hydrogen ion, the main 
solutions (27,38) that give rise to an additional degeneracy of the 
system will not be orthogonal* 

VlOO (r") — R) d'^x ~ 5^, (27,39) 


bar over a symbol denotes quantities referring to the neutral molecule. 



452 


RELATIVISTIC QUANTUM MECHANICS 


where S for the ion is given by the expression (27.34). The 
Coulomb energy of interaction of the two atoms is given by 







na 

r[ 

rj 




(27.40) 


where the first and second parenthetic terms in the integrand cor- 
respond to the potential energy of repulsion between the two nuclei 
and the two electrons, and the third and fourth terms correspond to 
the potential energy of attraction between the first electron and the 
nucleus a' and between the second electron and the nucleus a. 

In exactly the same way, we obtain the exchange energy 


A 5 'I- (27.41) 


The expressions for K and A can be computed aproximately by 
the same method as in the theory of the Hi ion. For K we obtain 
a comparativ^y simple result, and for A a more complicated 
result, since A is expressed in terms of an integral logarithm (as 
shown by Sigura). The general character of the solution, however, 
remains the same as in the theory of the II] ion. In particular, 
the main forces which hold the two neutral atoms in the molecule 
are the exchange forces. These forces have a minus sign at 
comparatively large interatomic distances and correspond to the 
mutual attraction of the atoms. Just as in the case of the molecular 
hydrogen ion, we have two solutions. The first solution is sym- 
metric 




V 2(1 l-S) 


(Vaa' Yo'a)> 


' 1 -f s 


and the second is antisymmetric 




-.f 

y2(i- s) 




' 1 — .S' ' 


(27.42) 


(27.43) 


The general form of the curves of for a neutral hydrogen mole- 

cule is approximately the same as for the molecular ion; there- 
fore, only symmetric solutions will give stable molecules. For the 
radius corresponding to the equilibrium position [that is, the 
minimum of the potential ener^ (R) for the symmetric solution], 
we obtain l.Slao™ 0.80 A. The corresponding value for the 
dissociation energy is 


D = ~ V® («o) = 0 . 1 15 = 3.2 ev. 


(27.44) 
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The experimental values of these quantities are 

/?e3cp^0.74A, r=4.73ev. (27.44a) 

We have omitted here the zero-point energy of oscillations, 0.27 ev 
(see Chapter 12), from both the theoretical and the experimental 
values. 


E. SPIN AND THE SYMMETRY OF STATES 

In the ion and the H atom, there is only one electron, and its 
spin leads only to insignificant spin-orbit interactions. On the 
other hand, there are two electrons in the H 2 configuration, and the 
spin plays an important role in the theory of this molecule, even 
though the spin-orbit and the spin-spin interactions give only 
small corrections. In the hydrogen molecule, just as in the helium 
atom, the mutual orientation of the spins of the two electrons 
determines the type of symmetry of the spatial part of the wave 
function; this is of primary significance in connection with the 
stability of a molecule. We shall therefore consider more fully 
the question of the relation of the spin to the symmetry properties 
of the molecule. 

The total wave function ¥ must contain a spin part in addition 
to the spatial part. When the potential energy of the spin-orbit 
interaction can be neglected, then, just as in the case of Russell- 
Saunders coupling, the total wave function can be represented by 
a product of a spatial part and a spin part. For electrons (which 
obey Fermi statistics) the total wave function must change sign 
when coordinates and spins are interchanged (that is, the solution 
must be antisymmetric). We therefore have two possibilities 

= (5„ s,)^s (r„ r,), (27.45) 

= (si, 5 . 2 ) (ri, r,). (27.46) 

It has been shown In Chapter 24 that the antisymmetric spin 
function describes two electrons with antiparallel spins, and 
therefore function which is symmetric in the position coordinates, 
corresponds to a state with total spin 0. In exactly the same way 
the symmetric spin function C^, as well as the antisymmetric 
spatial function, describe a state with total spin 1 (the spins of 
both electrons are parallel). In the case of the hydrogen molecule, 
the only solutions which results in attraction is that corresponding 
to ; thus, a stable molecule is obtained only in the case in which 
the electron spins are antiparallel. 

We shall now proceed to a general analysis of the states of a 
molecule, using the symmetry properties. In this connection, we 
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note that in diatomic molecules the field of force possesses axial 
symmetry with respect to the line passing through the nuclei 
(the symmetry axis of the molecule). The absolute value of the 
component of the total orbital angular momentum along this axis 
of symmetry (which, incidentally, must be conserved) is denoted 
by A, States corresponding to different A are denoted by the 
following letters: (A= 0); II (A== 1); A(A= 2); etc. 

In addition, each electronic state must be characterized by the 
total spin *S of all the electrons in the molecule. For a given value 
of S, [ 1 states are possible. The quantity v, as in the 

case of an atom, determines the multiplicity of the energy level. In 
the case where the total spin is equal to zero {S =^0) , we have 
1. For states with 5=1, the multiplicity v=3, etc. The 
total spin of electrons in a molecule can therefore be characterized 
by the multiplicity v, and the corresponding term can be denoted 
by 

In this notation, the symmetric solution for the spatial part of 
the wave function (that is, the solution ^^js) corresponds to the term 
*2] (that is, A= 0, 5 = 0, v= 1), while the antisymmetric solution 
corresponds to the term (A = 0, 5= 1, v= 3), It is obvious 
that the term corresponds to three states: in two of the states 
the spin is directed along the symmetry axis of the molecule (in 
a parallel or antiparallel direction), while in the third state it is 
perpendicular to the symmetry axis. 

It should be noted that symmetry plays a very important role 
in the theory of molecules (particularly in the case of complex 
molecules). If, for example, we reflect the wave function in a 
plane passing through the symmetry axis of the molecule (which 
we take as the z axis), the energy of the molecule must remain 
unaltered.^ At the same time, if the component of the orbital 
angular momentum or of the spin along the symmetry axis differs 
from zero (A fO or 5,7^0), the rotations which are associated 
with these angular momenta will be reversed as a result of this 
reflection.^ 

For simplicity, we restrict our treatment to the states in 
which the orbital angular momentum is zero, that is, A=0 (v 
terms). In the case where the total spin of the electrons also 
vanishes, that is, 5=0, no change of states will occur in the 
mirror reflection. 


^In the case of reflection in the kz plane, mirror reflection amounts to a replacement of 
V by — V 

7 

As IS well known, the angular momentum L - r xp is an axial vector whose direction 
IS a matter of convention (it has one direction in a right-handed system of coordinates, 
and the opposite direction in a left-handed svstem) The direction of the contour bounding 
the area and constructed from the vectors r and p remains, however, unaltered in both the 
right- and left-handed coordinate systems 
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If, however, the spins of both electrons are parallel (5 = 1), 
the following cases are possible. 

(a) The component of spin along the symmetry axis is equal to 
zero (S^=0). In this case, the rotation characterizing the spin 
remains unaltered as a result of mirror reflection (see Fig. 27,8, 
where the initial and reflected spins are characterized by a 
rotation denoted by II and IT). The corresponding terms are 
designated by the symbol 

(b) The component of spin along the axis of symmetry z differs 
from zero (5,= ^ 1). In this case, the rotation which we associate 
with the spin is reversed as a result of mirror reflection (see Fig. 
27.8, where the Initial spin is characterized by the rotation I and 
the reflected spin by the rotation /'). 


s s' 



Fip. 27,8. Change of the angular momentum on reflection 
in the AA'B'B plane, which passes; through the axis of sym* 
metry z If the initial rotation characterizing the angular 
momentum occurs m a plane perpendicular to B B (see I), 
the direction of this rotation will be reversed after reflection 
(see I) If, however, the rotation takes place in the plane of 
reflection, it will be unaltered by retlection (II —II) 


The terms whose spin changes on reflection are designated by 
the symbol 

Therefore, the following terms of the ground state of the hydro- 
gen molecule are possible: 

*v‘(A=0, 5 = 0), 

(A = 0, 5=1, 5, = 0), (27.47) 

■^v-(A = 0, 5=1, 5,= } 1 ), 

where the last term is obviously twofold degenerate. 

If the molecule consists of two identical atoms, there will be an 
additional symmetry property. A diatomic molecule with identical 
nuclei must have a center of symmetry, in addition to a plane of 
symmetry. This center of symmetry is the mid-point on the line 
joining the nuclei. In Fig. 27.8, it is located at the origin of the 
coordinate system, that is, at the point 2=0. In this symmetry 
transformation we must change the sign of the coordinates of all 
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the electrons. In particular, under this symmetry transformation 
the positions of electron 1 and electron 2 will be interchanged in 
the hydrogen molecule (the coordinates of the nuclei are left un- 
changed). The symmetric wave function 6^ will remain unaltered; 
that is, it is even (this is denoted by the subscript The anti- 
symmetric function changes its sign; that is, it is odd (this 
is denoted by the subscript «)• The main possible states of the 
hydrogen molecule, taking into account both symmetry properties, 
can be therefore denoted as follows: 


and so forth. 

The importance of symmetry with regard to the formation of a 
molecule follows also from the fact that the ground state of most 
diatomic molecules is a state in which the wave function is invariant 
under all symmetry trans formations.* Thus, is the main term 
of the hydrogen molecule. The question of molecular symmetry, 
however, lies outside the scope of this book. 

It should he noted that in stable states of the hydrogen molecule 
the spins of the two electrons are always oppositely directed. At 
the same time, there are two types of hydrogen molecules— para- 
hydrogen and orthohydrogen. These names refer to the orientation 
of the nuclear spins, and not to the orientation of electron spins. 
In parahydrogen the spins of the nuclei are antiparallel, while in 
orthohydrogen they are parallel. Since the number of possible 
states for two particles with parallel spins is three times larger 
than in the case of particles with antiparallel spins, ordinary hydro- 
gen at room temperature will consist of an equilibrium mixture of 
25% parahydrogen and 75% orthohydrogen. As the temperature is 
lowered in the presence of a catalyst (for example, charcoal), the 
percentage of parahydrogen in the equilibrium mixture increases, 
and is practically 100% at 0®K. Parahydrogen produced at low 
temperatures is extremely stable and can be preserved in such 
an equilibrium system for a period of several weeks at room 
temperature. Orthohydrogen has not yet been obtained in pure 
form. The difference in the thermal conductivities at low tempera- 
tures (the thermal conductivity of parahydrogen is larger) is used 
for determining the composition of the mixture. Similarly, para- 
hydrogen and orthohydrogen have somewhat different dissociation 
energies and optical properties. 

F. THE VALENCE THEORY 

We shall now explain the concept of chemical valence in terms 
of quantum mechanics. By chemical valence we mean the ability 


8 

3V- 


As an exception to this rule, let us mention, for example, the O2 molecule, for which 
IS the mam term 
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of an atom to combine with a specific number of other atoms. As 
already mentioned, the first success of quantum theory in con- 
nection with the chemical properties of atoms was the explanation 
of heteropolar chemical compounds (Kossers theory); these com- 
pounds are formed as a result of the redistribution of electrons 
in the outer shells of the participating atoms. According to this 
theory, the numerical value of the valence is determined by the 
number of electrons which an atom gives up to another atom (posi- 
tive ionic valence) or acquires from another atom (negative ionic 
valence). In the formation of a molecule, the electrons in the 
outer shells of atoms are redistributed so that the valences of 
the atoms are saturated. 

Further progress in the investigation of the formation of a 
molecule was made with the He itler- London theory. This theory 
succeeded in explaining the formation of the simplest homopolar 
molecule Hj, which serves as the basis of our present concept of 
the covalent bond. According to the He itler- London theory, the 
spins of the valence electrons are mutually compensated in the 
homopolar hydrogen molecule. Generalizing these results, it is 
possible to conclude that the formation of homopolar molecules 
occurs under the condition of mutual compensation of the spins 
of the valence electrons. Accordingly, this type of valence is 
also sometimes called the spin valence. 

Since the saturation of valence bonds amounts to a compensation 
of the spins of the valence electrons, the chemical valence of atoms 
is given by the number of electrons with an uncompensated spin 
present in the outer shell. 

To illustrate these general principles, let us consider some 
specific examples. Figure 27.9 gives the ground- state configurations 
of several elements of the periodic system. The electron states 
are shown as boxes, while the electrons are denoted by arrows 
whose directions correspond to their spin orientations. It is clear 
from this figure that the configuration of the outer shell of the 
hydrogen atom (ls‘)^5 corresponds to a single valence bond. The 
valence of hydrogen, which is equal to one, is smaller by a factor 
of one than the multiplicity of its terms, which is two (the multi- 
plicity is designated by the superscript on the left-hand side of 
the term symbol S). 

Similarly, the ground state of the helium atom has the con- 
figuration (2s'^) It is evident that the multiplicity equals one (*5), 
while the valence is equal to zero. 

The boron atom (Z = 5) has the ground state (ls^2s^2/?'), corre- 
sponding to the doublet (^P), and consequently the valence equals 
one. The excited state (ls"2s*2p0» corresponding to the quartet 
(*P) is also possible; in this state the valence of boron is equal to 
3. Thus the presence of several different valences in the elements 
of various groups in the periodic system can be explained in 
comparatively simple form (see Table 27.2). 
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Although according to experiment the elements of oxygen and 
halogen groups can have several different valences, the O and 
F atoms themselves show only the principal valence. This is 
due to the fact that their multiplicity can be increased only if 
an electron is transferred to a shell with a larger value of the 
principal quantum number. This process is unfavorable from 
the energy standpoint (the d subshell is absent in O and F). On 
the contrary, for other elements of these groups there is a pos- 
sibility of transition between states of the same shell, but having 
different values of I 


Is 2s 2p 

H [T^ ' ^ ' [1s’) 

He [tj] Us^] 



(isns^?p') 

{ls‘2s’2p’‘) 

Us^2s^2p->) 

{1s^?s‘2pJ) 

{ls^2s’?p^) 

{ls'2',‘2p-) 

Us^2s‘2p^) 

[ts'2s^2p>>) 


I' lg 27 y. Diagram showing the filling 
of the electron shells of several atoms 
with the spin taken into account Homo- 
polar valence of atoms is denoted by a 
dot, and ionic valence is denoted by 
a { (positive) or — (negative) sign 


Table 27.2 

Multiplicity and homopolar valence 


Group of the 
perioic sys- 

I 

II 

III 

IV 

V 

VI 

VII 

tern 








Multiplicity 

I 

2 

1, 3 

2. 4 

1, 3, 5 

2. 4, 6 

1, 3, 5, ?! 

2, 4, 6, 8 

Valence* 

1 

0, 2 

I. 3 

0, 2, 4 

1. 3, 5 

0, 2,4,6 

1 , 3, 5, 7 


The bold-face type indicates the principal valence. 
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According to the configuration given in Fig, 27,9, nitrogen in 
the ground state (ls^2s'*2p’) is trivalent (the three electrons in the 
2p shell have parallel spins). However, it can also be univalent 
(antiparallel spins of the two electrons in the 2p subshell) and 
even pentavalent (l5^2s'2p^) (the four spin valences, associated 
with the parallel spins of the electrons in the 2s and 2p subshells, 
are augmented by a fifth ionic valence associated with the removal 
of a second electron from the 2 s subshell). In this connection we 
note that the ionic valence of oxygen and fluorine is the same as 
the spin valence (covalence). 

It should be emphasized that, in general, it is impossible to 
divide rigorously the chemical bonds into homopolar and hetero- 
polar ones. The two types of bonds correspond to the limiting 
cases of the electron density distribution in the incomplete shells. 
The limiting case of asymmetry in the distribution of the electron 
density between the atoms corresponds to a heteropolar molecule. 
Such a molecule has a dipole moment and can be regarded as an 
ionic structure. The case of a homopolar bond corresponds to 
identical electron density distributions in the atoms of the mole- 
cule. A homopolar molecule has no dipole moment, and it can be 
considered as a structure formed from two neutral atoms. 

Quantum theory provides a general method for the explanation 
of valence forces and deals with both types of bonds (homopolar 
and heteropolar) in a single scheme. One of the chief merits of 
the Heitler- London quantum-mechanical theory of the molecule 
is that it explains the saturation of homopolar bonds in terms of 
the saturation of the spins of the electron shells when electrons 
combine into pairs with antiparallel spins. When a hydrogen atom 
approaches a Hi molecule no additional pairs with compensated 
spins are formed, and hence there is no gain in energy. The H., 
molecule therefore cannot exist. 

It must be emphasized, however, that the Heitler- London theory 
was developed only for molecules consisting of atoms in the s 
state, and therefore an extension of its conclusions to more com- 
plex atoms must be of a somewhat qualitative nature. Further 
development of the theory of homopolar bonds has shown that in 
the case of complex atoms quantum laws alone are not sufficient. 
In this case we must also Include the influence of the specific 
properties characteristic of the chemical compounds. 


G. MASERS AND LASERS 


The electromagnetic waves radiated by conventional radio transmitters have a com- 
paratively wide frequency band. Everyone who has used a radio receiver knows that 
transmitting stations which have nearly equal frequencies overlap one another. This is 
due to the fact that conventional transmitters have insufficient stability and often “drift” 
into another frequency band. 
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Thus, the stability of even the best existing quartz oscillators is inadequate in a 
number of cases (for example, high-stability oscillators are necessary for accurate 
determination of distances by means of radar). For this reason, one of the great achieve- 
ments of recent times was the development of masers'^ by Townes and coworkers, and 
Independently by Basov and Prokhorov. In this device, quantum transitions between 
discrete energy levels in a molecule are used as microwave generators. 

We note first of all that the process of spontaneous emission, which depends on the 
Einstein coefficient A, has no essential significance in the case of molecular emission, 
since its Intensity is proportional to ta* and is extremely small in the radiofrequency 
region (that is, in the range of frequencies that are low compared with light frequencies). 

As far as induced transitions, which are proportional to the Einstein coefficient i5. are 
concerned, the probability of upward transitions (resonance absorption) and the probability 
of downward transitions (induced emission) are identical (see Chapter 9); they are pro- 
portional to the energy of the field. Therefore, a system (molecule) with two levels will 
undergo a transition from one level to another under the influence of a sufficiently strong 
external field containing the resonant frequency, the transitions being accompanied by the 
emission and absorption of quanta. 



Fig. 27.10. Structure of the ammonia molecule 
showing two mirror-symmetric states a and b 
of the same energy. 

If external radlauon is passed through a substance, it will interact with its molecules 
and cause resonant absorpaon as well as induced emission. In accordance with the 
Boltzmann distribution (the number of particles in an equilibrium system with energy E 

is proportional to there must be fewer particles in the higher energy states 

than in the lower states; therefore, absorption dominates induced emission in the thermal 
equilibrium. The excess of absorbed energy is completely converted into the energy of 
thermal motion of the molecules, raising the temperature of the gas. 

In order for the system to amplify and not absorb the radiation incident on it, it is 
necessary to disturb this thermal equilibrium in such a manner as to produce greater 
occupancy of the higher energy levels than of the lower ones, When this is done, 
such a system will generate electromagnetic waves with extremely small line width 
under the action of resonant radiation. The first masers were produced with ammonia 
(NH3) molecules. The ammonia molecule consists of one nitrogen atom and three 
hydrogen atoms and forms a right pyramid. However, from the laws of symmetry, the 
stable state of the ammonia molecule corresponds not only to the case in which the 
nitrogen atom is located above the triangle composed of the hydrogen atoms (Fig, 27.10a), 
but also to the case in which the mtrogen atom is located at the same distance under the 
triangle (Fig, 27.10bX stable states n and ^have equal energies and correspond to the 


**The tern “masor” is an acronym formed from the first letters of microwave amplifi- 
cation by stimulated emission of radiation. 

^*^Such a state can also be described by the Boltzmann distribution function, if it is 
assumed that the temperature T, which in this case plays the role of a parameter, assumes 
negative values 
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minima of the potential energy; they are thus separated by a potential barrier, so that 
classical theory does not allow transitions from one to the other. In quantum theory, 
however, the probability of penetration across the potential barrier is different from 
zero. Consequently, the nitrogen atom, which Is above the triangle, may reposition it- 
self below the triangle in the absence of any external action, and may then return to its 
original position. In molecular physics this phenomenon is called inversion. The process 
of inversion can be explained in terms of the existence of two perfectly identical potential 
wells, separated by a potential barrier of finite width. It is well known that this leads to 
a splitting of the spectral lines, in spite of the fact that the energy states in the two po- 
tential wells are completely identical to one another. For the ammonia molecule this 
splitting, expressed in wavelengths, is 1.27 cm for the most intense line; this corre- 
sponds to a radiofrequency wave. 

Another remarkable property of the ammonia molecule is that, with the help of an 
electric field, it is fairly easy to separate the molecules in the upper and lower energy 
levels that have been formed as a result of inversion. It is found that when an ammonia 
molecule is placed in an electric field, the two levels are shifted in different directions, 
the upper inversion level being shifted upwards (the energy increases) and the lower 
inversion level downwards. 

Since any system tends to a state in which its potential energy is minimum, we first 
pass a beam of ammonia molecules through a carefully evacuated vessel containing an 
electrostatic field which is produced by a quadrupole capacitor; this field decreases 
towards the symmetry axis (see Fig. 27.1 IX 

In this separating system the molecules in the upper level will tend to move to a 
region where this field is minimum (since the field increases their energy); that is, 
they will be focused about the axis of the capacitor. Ihe molecules that are in the lower 
level, however, tend to move to the region of maximum field (that is, to the periphery, 
where their energy will be minimum), and thus they will be ejected from the beam. After 
passing through the above separating system, the beam of molecules, which contains a 
predominance of excited molecules, enters a cavity resonator which, among its rather 
wide range of radio frequencies (produced by conventional radio frequency methods) also 
contains the resonant frequency of the molecular transitions. 

llus external radiation causes molecular transitions which are associated with a 
line of small width (-v. 10 sec-i) ata frequency of 2.4 x 10^° sec-i (that is, with a relative 
error no greater than 10“^), corresponding to the wavelength of 1.27 cm. This wavelength 
is determined mainly by the time of traverse of the molecules through the cavity resonator, 
A molecular generator produced in such a way exlilbits extremely stable frequency, 
'Ihe frequency stability is so high that molecular clocks constructed on this principle 
have an accuracy of approximately 1 sec per 300 years of continuous operation. i ^ 

On the basis of this principle, several types of radio receivers working in the micro- 
wave region have been constructed. They all use paramagnetic crystals cooled to very 
low temperatures. These devices, known as quantum paramagnetic amplifiers, greatly 
increased the sensitivity of radio astronomical and radar equipment. 

Recently, atoms and molecules have been used as generators and receivers in the 
visible spectrum. In this case, we speak of lasers (light amplification by stimulated 
emission of radiauonX 

What is the difference between an ordinary source of light and a laser? In an ordi- 
nary source of radiation, for example, the Sun, the spectrum consists of a broad fre- 
quency band and the individual incoherent quanta have arbitrary phases and directions. 
Optici lasers enable us to obtain a monochromatic beam of light of high intensity (when 
it is focused we obtain a radiation density a thousand times greater than that obtainable 


^^Thr stability of the frequency in masers made it possible to construct two standard 
independently operating, synchronized clocks These clocks were used to measure the 
velocity of light in one direction under terrestrial conditions and therefore to check the 
fundamental conclusions of the special theory of relatr, ’ty regarding first-order effects 
(proportional to the velocity n ^ < fi). 

We recall that all interference experiments similar to the Michelson experiment involve, 
in effect, only one standard clock and are used to measure the velocity of light over a 
closed path Thus, only second-order effects (proportional to j3^) could be detected in 
these experiments 
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by focusing sunlight), great coherence and extreme sharpness (with telescopic apparatus 
a beam of light reaching the Moon would have a diameter of the order of 3 km). The 
first laser produced by Maxman was a three-level ruby laser. Chromium atoms, which 
are present in ruby (aluminum oxide) as a slight impurity, absorb light from a power 
klystron over a wide band in the green and yellow regions. Initially, these atoms give 
up (without emitting radiaaon) part of their energy to the crystal lattice and make a 
transition to a metasrable state from which dipole transitions are forbidden, and hence 
they can be maintained in that state for a comparatively long time (of the order of 
several milliseconds). 



Kig 27,11 Llectrostatic field of tho 
capacitor in tho separating system 


At a ''negative** temperature (when the number of metastable atoms is greater than 
the number of unexcited atoms and under the action of stimulated emission, such a 
system is capable of generating almost instantaneously monochromatic waves with a 
wavelength of 6943 A (red region^ 'Ibese transitions are induced by the first emitted 
quanta, which are retained in the ruby (the ruby has the shape of a circular rod, bounded 
by silvered parallel ends, one of which is semitransparent jL Photons moving parallel to 
the axis induce the emission of photons having the same frequency and moving in the 
same direction, Ihis chain process intensifies until a coherent ray finally passes through 
the semitransparent mirror (after multiple reflection from the ends). 

The prospects for practical application of lasers are enormous. These devices can 
be used for the simultaneous transmission of various types of information, the establish- 
ment of cosmic communication, the control of chemical reactions induced by thermal 
excitation and so forth. 
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Chapter 28 


Elastic Scattering of Particles 


A. TIME-DEPENDENT PERTURBATION THEORY 

In the case when the Hamiltonian is an explicit function of time, 
it is not, as a rule, possible to obtain an exact solution of the 
Schrodinger equation. If the time-dependent part of the Hamiltonian 
V'(^) is small in comparison with the time- independent part of the 
Hamiltonian time-dependent perturbation theory can be used 
to solve the problem. 

It is obvious that time-dependent perturbation theory can also 
be applied when the perturbation energy is independent of time. In 
this case we obtain the same solutions as in the stationary per- 
turbation theory of Chapter 14, 

Including the perturbation K' (/), we write the Schrodinger equa- 
tion as 


- T == IH* + V' (01 Ht)- (28. 1) 

The solution of this equation in the zeroth approximation (!/'(/) = 0) 
is 

= ( 28 . 2 ) 


where the wave function is a solution of the unperturbed time- 
independent wave equation 

= (28.3) 

and satisfies the orthonormality conditions 

(28.4) 

We shall look for a solution of Eq. (28.1) in the form of Eq. 
(28.2), assuming that the coefficients C„ depend on time: 

This approximate method of solution was proposed by Dirac and is 
known as the method of variation of constants. 
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Substituting Eq. (28.2) into Eq. (28.1) and using Eq. (28.3), we 
obtain the following equation for the unknown coefficients C^: 




- r , 

^ i 










(28.5) 


-E ft 

Multiplying Eq. (28.5) by " , integrating the resulting 
expression over all space, and using the orthonormality condition 
(28.4), we obtain a system of equations for the unknown coefficients 
Cn' 

— (0, (28.6) 


where 


(1) 


n'/i" 



(28.7) 


and the matrix element 


Vn'n"(t)= f ^*n’V'l£i%:d^X. 


Dirac’s system of equations (28.6), taken for all values of n\ is 
completely equivalent to the original wave equation. 

The approximation used in perturbation theory consists in ex- 
panding the solution in the form 

Cn> = Cn' -j- Cn' Cn' . t (28.8) 

where the zero-order coefficients Ch' are independent of V\ The 
coefficients for the first and higher orders will be proportional 
to V', and so forth. 

Substituting Eq. (28.8) into Eq. (28.6) and retaining only the 
first-order terms, we find the following system of equations for 
the coefficients Cn'i 


C° . = 0 (zeroth approximation) , 


(28.9) 

— y (first approximation) and so on. 

n" 

The first of the equations (28.9) shows that the unknown zero- 
order coefficients must be independent of time, that is. 


CJ-= const. 


(28.10) 
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Their values are given by the initial conditions and characterize 
the state of the electron before the perturbation is applied. 

Let us assume that at ^ = /othe electron is in the state n. Then 
we may set 


Cn" ^nn"‘ (28.11) 

This expression specifies the initial conditions of the problem. 
Substituting Eq. (28.11) into Eq. (28.9), we find 

t 

Cn' (0 = - -J- J (0. (28.12) 

^0 

In quantum mechanics one generally calculates the transition 
probability w per unit time. Since the probability of finding a 
particle in the state /i' is given by | Cn^ p, we obtain the following 
expression for the transition probability: 

(28.13) 

n' 

Equations (28.12) and (28.13) are the basis for the investigation 
of many quantum-mechanical problems in the first-order time- 
dependent perturbation theory. 


B. THE CROSS SECTION FOR ELASTIC SCATTERING 


We shall now apply the results obtained above to the study of 
the elastic scattering of an electron. 

We assume that at the initial time ^o = 0 the particle is free; 
that is, it moves uniformly with a momentum p = tlk and an energy 


E 


2niQ 



At the instant of time /o==0, 
the particle comes within the 
range of interactions; that is, 
it moves in a potential V (r). 
The particle now has a finite 
probability of making a tran- 
sition to the state with the 
momentum p* = 1ik and energy 

£'= chK = that is, 


the particle is scattered as 
a result of interaction (Fig. 
28.1). 



Fig. 28.1. Scattering of a particle by a cen- 
ter of force* flk IS the momentum of the 
incident particle, flk' is the momentum of 
the scattered particle, ^ is the angle of 
scattering, 0 denotes the scattering center 
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The wave functions of the initial and final states, describing the 
free motion (in the zeroth approximation), are [see (4.62)] 


^ * * (28.14) 

where L is the period, and the momentum components ki and 
k\ (i=l, 2, 3) are related to the integers ru and n'l by means of 
the expressions 




"zT’ 



(28.15) 


Substituting the wave functions (28.14) into Eq. (28.12), we find for 
the coefficient Cn- 



eh{K‘ — lO ’ 


(28.16) 


where the matrix element of the perturbation energy is [{y' — V (r)] 


Vn — e‘^V (/*) d'x, X = A — 
Hence, the transition probability is 




.1.,, |2 2sin (/C' — AT) 

" ch-iK'-K) ' 


(28.17) 


We note that the function 


t sincf(/C— AO 


(28.18) 


has a sharp maximum for sufficiently large values of t and 
K' — K->0. This means that in practice we can restrict ourselves 
to only those changes of K' for which the following condition is 
satisfied; 


c/|/C' — K!~2ic. 

Since the quantity t = t — ta = M is the time elapsed from the 
initial instant, and the quantity cfl{K' — /()=A£ is the energy 
spread resulting from scattering, we find a relationship between 
these quantities 




(28.19) 
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This relation can be considered as a fourth uncertainty relation; 
it is usually obtained from the theory of transition processes. 

The uncertainty in the energy is characteristic for any wave 
process; its optical analog is the familiar expression for the 
broadening of spectral lines, resulting from the finite duration of 
the emission. 

For sufficiently large values of the uncertainty in 

energy tends to zero and Eq. (28.18) becomes a statement of the 
law of conservation of energy 


K' = K^ 

This explains why this type of scattering is said to be “elastic.’^ ^ 
Mathematically, this follows from the fact that the function (28.18) 
is a S function at / = oo. Integration of the h function leads to the 
replacement of K' by /(. To show this, let us consider the integral 

/ ^ -L ^ F (K') dK'. (28.20) 

0 

Introducing a new variable 




we get 


-ctK 

If the function F has no singularities, we obtain as ^->oo 


-f 00 

J ~^d\ = F(K). 


On the other hand, from the definition of the 8 function it follows 
that 


8 (AT' -K)F (K') dK’ =F(K). 


(28.21) 


^An example of inelastic scattering is provided by bremsstrahlung— a scattering process 
in which an electron emits a photon, so that K K. 
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Hence it is clear that expression (28.18) becomes a 8 function as 
and, consequently, Eq. (28.17) for the transition probability 
can be written as 


w = (28.22) 

k' 

Replacing the summation (28.22) by integration we use, in accord- 
ance with Eq. (28.15), the following relation: 

(^jJ==k’Uk'dQ = k^k'dK'd2. (28.23) 


The scattering process is usually characterized by a cross sec- 
tion, which is equal to the ratio of the probability w to the number of 
particles N incident per unit time on a unit surface S perpendicular 
to the incident beam. Obviously the particles that strike this sur- 
face per unit time are those located at a distance not exceeding 
the velocity of the particles y, that is, the particles contained in 
the volume vS = v. The number N is equal to the number of parti- 
cles per unit volume po = f' ^ multiplied by a volume which is 
numerically equal to the velocity of the particle 


N = 



ft 

fto’ 


(28.24) 


With the aid of Eqs. (28.22) - (28.24), we find the following 
expression for the scattering cross section: 


o = ~ = |o(», y)da. (28.25) 

The integrand characterizing the number of scattered particles 
incident per solid angle d^(d2 = sm^d^ where ^ and y are the 
spherical scattering angles), known as the differential cross sec- 
tion, is equal to 


= (28.26) 

In particular, when the scattering center is spherically symmetric, 
we have 


CO 

s 


where dQ’ is the solid angle associated with the vector r, whereas 
in Eq. (28.25) d2 is the solid angle associated with the vector A'. 
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Integrating the last expression over the solid angle we 
find 

oo 

1/^= ^ r sin yr V (r) dr, 

0 

From this it is clear that the differential cross section of the 
elastic scattering is equal to 

o(»)=l/(»)r. (28.27) 

where 

x= lA — *'l = 2^sin-2-, (28.27a) 

and the quantity 

oo 

/ W = — ' j /' sin xr • F (r) dr (28. 28) 

is called the scattering amplitude. 

Equation (28.27), describing the elastic scattering of the par- 
tides by a center of force V {r) in first-order perturbation theory, 
was originally developed by Born; it is therefore called the Born 
approximation. 

We note that this problem can be also solved in the time- 
independent perturbation theory, since the potential energy of in- 
teraction is time- independent. To obtain the scattering cross 
section we have used, however, the time-dependent perturbation 
theory, the mathematical apparatus of which is comparatively 
simple but is much more general. In particular, it can be used to 
solve many problems in modern quantum electrodynamics, taking 
account of the interaction of electrons with the doubly quantized 
electromagnetic field (see Chapter 29). 

The expression for a(0), obtained from the perturbation theory, 
is applicable only within some definite limits. In the case of short- 
range forces (nuclear forces, neutral atom, impenetrable spheres, 
and so forth), which can be neglected for r > /?, where /? is a cer- 
tain effective radius, the magnitude of the cross section must be 
either less or of the same order as their geometric cross section 
(even if these forces create an absolutely impenetrable barrier). 
For short-range forces, therefore, the range of applicability of 
the perturbation method is given by 


(28.29) 

Equation (28. 27) is not applicable to long-range forces (Coulomb 
forces) if the scattering angle is small. This question requires 
a more detailed analysis (see below). 
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Co SCATTERING BY THE YUKAWA CENTER OF FORCE 

The form of the potential energy for the Yukawa Interaction is 
as follows: 


V = 



(28.30) 


where A is a constant related to the charge and is the effec- 

tive range of these forces. The interaction (28.30) has some im- 
portant applications in the theory of nuclear forces and, in partic- 
ular, in meson theory. 

For nuclear forces, the quantity A is equal to where g’ 
determines the strength of the potential. The range of nuclear 
forces is equal to the Compton wavelength of a pion 


R = 10 cm. 

m^c 


(28.31) 


In the case of fast-electron scattering (or a -particle scatter- 
ing) by neutral atoms, the interaction potential given by the Thomas- 
Fermi model can be approximated by the expression (28.30).^ In 
this case the quantity A = Zel, where Z is the atomic number, and 
the effective radius R of the atom in the Thomas- Fermi model is 
equal to 



(28.32) 


where 7 is a niunerical factor of the order of unity. 

Finally, setting JR— ^oo, we obtain the Coulomb potential of the 
nuclear field which, consequently, can be considered as a limiting 
case of Eq. (28.30), 

Substituting Eq. (28.30) into (28.27) and using the relation 


CO 00 

I r sinxr V {r)dr = — A ^ sin vr dr =■ — A » 


we obtain the following expression for the differential cross section 
of elastic scattering; 


a(^) = 




(28.33) 


^As has been mentioned in Chapters 25 and 27, a more accurate approximation of the" 
Thomas-Fermi potential is given by the expression (25 22) The results of the two approx- 
imations, however, do not differ greatly from one another (which is a consequence of the 
short-range character of the forces). The approximation (28 30) is more convenient in 
calculations of scattering. 
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Here, according to Eq. (28.27a), 


where p is the momentum of the particle. 

Two cases should be distinguished in the analysis of Eq. (28.33). 
1. Scattering of relatively slow particles, when for all 

scattering angles. In this case Eq. (28.33) shows that a (^) is inde- 
pendent of the angle ^ and becomes equal to 




(28.34) 


The independence of the scattering cross section of the angle 0 
(isotropy) is a characteristic feature of scattering of relatively 
low-energy particles by a center of short-range forces. 

2. In the case of relatively fast-particle scattering, the differ- 
ential cross section becomes independent of R (the effective range 
of force) for all scattering angles satisfying the condition x7?V 1. 
Equation (28.33) in this case reduces to 

a(»)=igll. (28.35) 

It is evident that for angles satisfying this condition the scattering 
by the Yukawa potential is the same as the scattering by a Coulomb 
center. Therefore, in the case of fast electrons or oi- particle 
scattering by a neutral atom through comparatively large angles, 
the influence of atomic electrons is not important and the scatter- 
ing is determined only by the potential of the nucleus. 

Setting A=Zel and x = -^sin -2 in Eq. (28.33), we obtain the 
familiar Rutherford equation 




4p* sin^ Y 


(28.36) 


which was obtained by the classical method in Chapter 2. Equation 
(28.36) shows that for long-range forces there is a strong de- 
pendence of the scattering cross section on the scattering 
angle ^ . 

However, for any large values of the wave vectors £ we 

can always find small angles ^ such that the following inequality 
is true: 
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In particular, as ^—*0, Rutherford’s formula gives a divergent 
value for o(0); in this case we must take into account the short- 
range character of the forces resulting from the screening action 
of the electron shell. The condition (28.37) now determines the 
region of inapplicability of the Rutherford formula. 

For 0 = 0, that is, for forward scattering (x = 0), we find from 
Eqs. (28.32) and (28.33) the following expression for the differential 
cross section: 


a(0) = 47*aJZ*/» (28.38) 

d-^o 

The total cross section can be obtained from Eq. (28.33) 


C sin 

n* ] — cos»)P 

0 

_ I 

n* Ak^R^ + 1 • 


(28.39) 


Finally, with the aid of Eq. (28.29), we can find the range of 
applicability of the perturbation method for our problem in the two 
limiting cases considered above 


■4 tor I. 


(28.40) 


problem 28 .l 5 jRepresent the cross section for scattering of particles by a spherically 
symfliLLiiL pULGifflal as a sum of partial cross sections (the sum of the cross sections 
for waves with a well-defined value of the orbital quantum number / ), Obtain the scatter- 
ing cross section for the general case and for the Born approximation. 

Solution, Suppose the incident particle has a momentum p = 7lk and velocity 
v=:i1iklmo^ directed along the z axis. 

The wave function of the incident particle is of the form of a plane wave 

d/ — pikz 

nnc — ' 

A plane wave can be expanded in spherical harmonics (see Problem 21.5). Then, using 
the asymptotic expression for the Bessel function as r —^oo 


J 

H- 



the Incident wave can be represented as 


^ sin (kr — 

( 2 ^ + >) Hr • («“»)• 

2 — 0 


(28.41) 
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In the presence of the potential energy l/(r), the asymptotic expression for the wave func- 
tion of the particle in a centrally symmetric field, in accordance with Eqs. (13,75) and 
(13,78), should be chosen in the form 


sin [ftr — +S/] 

^ C, Pi (co,9) 


j = o 


where the phase shift can be determined from the asymptotic solution of the Schrbdinger 
equation for the radial function In the presence of the potential V (r) 

Clearly the scattered wave is 


'l^scat 'l^inc 


iVi 

-i(kr-±') \ 


The unknown coefficients C/ can be determined from the condition that the function 'J^scat 
must be a diverging spherical wave. Thus the coefficient of the converging wave 

£ ^ ' must be equal to zero, Ihen 


rscat 




The function /(O) is the scattering amplitude [see (28,28)], which, according to the 
exact theory, is equal to 


= I)P/(cos9). (28.43) 

i=K) 


The differential cross section characterizing the scattering of particles through an 
angle § is equal to the ratio of the probability of scattered particles passing per unit 
time through an element of the spherical surface dS = r^dQ: 

^«^scat=«4|cat I” 

to the number of particles incident per unit time on a unit surface perpendicular to their 
velocity, that is, perpendicular to the z axis 

From this we find the differential cross section 

do = = (/(«) |> 2a sin»rf». (28.44) 

'^inc 

Here, assuming the scattering field to be axially symmetric, we set the solid angle 
equal to 


dQ = 27 c sin Sdd. 
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Substituting the value obtained for the scattering amplitude and using the orthonormality 
property of Legendre polynomials, after integrating over the angles we get 

n 

f P, ( cos 9) Pf ( cos 8) sin . l .,, , 

o*' 1 “ ’ 

which gives us the following expression for the total cross section: 

00 

<J = ^2I(2/+l)sin*^. (28.45) 

/=0 

This expression for o is the desired sum of partial cross sections. 

Let us compare the expression for the scattering amplitude found in the Born approx- 
imation [see (28.28)] with the exact expression (28,43)1 The comparison shows that the 
exact expression gives the same result as the Born approximation for small values of 
the scattering phase angles 8/. Indeed, when 8/ 1, expression (28.43) becomes 

00 

Solving Eq, (28,42) for a given I (for a partial wave) by perturbation methods, we can 
show that 

00 

- Tr I" V(r)rP , (kr)dr. (28.46) 

"I (fg- 

0 


Next, using the expansion 

sin /r 
xr 


00 

2-^y(2(+ l)P^_^(ftr)P,(cos8), 


where 


-/, = 2k sin 


{V 

2 ’ 


the scattering amplitude (28.45a) can be reduced to the form (28,28) found in the Bom 
approximation, 

^ "I— 

^groblenn^28^2»irf^^®^^i^® th® cross section for scattering of particles by a spherically 
symmelH'c potential barrier of height l/o> 0 and radius a 

Vo for r<:a, 

\ 0 for r > a, 

when the radius a is much less than the de Broglie wavelength of the scattered particles, 
that is, when ka^ 'l. 

Show that in this case the s wave (wave with / — 0) is the main contributor to the 
scattering process. 

Compare the exact solutions with those obtained by perturbation methods. 

Solution, It may be seen from Eq. (28.46) that at ka 1, the s wave (/ = 0) is the main 
contributor. 

Solving the problem by perturbation methods (Born approximation), that is, using 
Eq, (28.28), we find the scattering amplitude 


/(») = - 


2moVo 


5 - 


2/;»oVo Q=> 

3h-’ * 
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Consequently, the cross section in the Bom approximation is equal to 





sin^|/(d)|2 = 


I6;r 


n* * 


(28.47) 


Let us determine the scattering phase shift in this simple problem. We restrict 
ourselves to a determination of the phase shift for / = 0 and low energies when we can 
set ka < 1. 

When / = 0, the wave equations have the form 


where 


+ = 0 for r>a, 

z; — !t'Vo = 0 for rca, 


(28.48) 


Zo = ^or, = 

Using the boundary conditions Xo(0) = 0, the solution of Eq. (28,48) can be represented 
as 


y.o 


=1 


A sinhx'r 
sin (kr + 5o) 


for rCa, 
for r:>a. 


Equating the wave functions and their derivatives at the boundary of the region r = a, we 
find in our case of small t (E < Vo) 


5o = arc tan 


J- tanh 


-ka^ka 


f tanh xa 




%a 


)■ 


(28.49) 


where 


X(2 



a. 


Hence, according to Eq, (28.45), we have 


In the case'^ 


a = 


( tanh xfl 
%a 



I 


(28.49a) 


(28.50) 


we can set 


t anh xfl j 


Then Eq, (28.49) gives an expression for a corresponding to the Born approximation 
[see (28.47)]. 

Whenxa > 1 (that is, when Vo — oo) the cross section reaches its maximum value 

a = 4710*. (28.51) 


^We note that the condition (28.50), which in this problem can also be written as 

Ob ^ . 

IS equivalent to the condition (28.29) for the applicability of perturbation method 
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The last expression is four times greater than the corresponding cross section for 
elastic scattering by an impenetrable sphere, calculated according to classical mechanics 
when '3 is determined simply by the geometric cross section of the sphere This 
discrepancy Is due to the appearance of wave properties (dlffracuon) of the scattered 
particles. 

^TPro blem 28^ Determine the cross section for scattering of slow particles I) 
byTTSphei ILlUI^ymmetric potential well: 


i — Vo for r < a, 
\ 0 for r>fl. 


(28.52) 


Indicate what distinguishes the cross section for scattering of parades by a potendal 
well from the cross section for scattering by a potendal barrier. 

Solution, In the exact solution of the problem, Eq. (28,48) should be replaced by 

y: + k‘/o=-0 for rx/, ,28 53 ) 

Z; + ft's/o = 0 for rc a, 

where 




The phase shift 8© is given by the following expression [instead of Eq, (28,49)]: 


fto = arc tan 



(28,54) 


Wlien k'a<^ 1 and Vo, we again obtain Eq, (28.47), which was derived in first- 
order perturbauon theory. In tliis case the cross sections for scattering by a potential 
well and by a potential barrier are identical. 

ITie difference appears when the quantity xa ~ Vo^J^^comes comparable to or 
, „„ , , . ' * , / . tanh xa 

greater than unity. Ihus, in the case of a potential barrier, the quantity - ^ monoton- 
ic ally approaches zero as l^o increases, whereas in the case of a potential well the 

lank' a | 

corresponding quantity vanes periodically over the range from 0 to oo as 

Vo Increases, 


In particular, if the quantity k’a approaches ^ , we find the following expression for 
k'(i ^ 

the cross section in the region 

tan k’ a 


_ W 
^res“ 


(28,55) 


Because of the smallness of the quandty a*, the expression for a is much 

greater than the maximum value for the cross section in the case of scattering by a 
potential barrier (^rnax == when Vo— *^oo [see (28,51)]), Since the relation = 

= = — for fta < 1 is actually equivalent to the condition of appearance 

of the first level in the spherically symmetric potential well,xfl ^ ^ the cross section 

(28,55) corresponds to the case of "resonant'' scattering. Subsequent resonance maxima 

3 5 

of the cross section occur when y n , and so forth. 

Note. The scattering phase shift, together with the expression for the cross section, 
can be accurately determined for a very limited class of problems (scattering by a 
spherically symmetric potential barrier or potential well, scattering by a potential in- 
versely proportional to the square of the distance, and so forth). However, in the general 
case we must use approximate methods. For intermediate values of the potential energy V© 
(for instance, in the case of scattering of charged particles by a Coulomb field the constant 
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characterizing the potential energy must satisfy the condition 1) perturbation 

theory (that is, the Bom approximation) gives good results. In cases when the Bom 
approximation is no longer applicable, the phase can be determined by other approximate 
methods, for example, the WKB method [see Eq. (13.78)], the variational method and 
so forth. In view of the special nature of these methods we shall not discuss them In this 
general treatment.'* It should be noted that many qualitative features, which are char- 
acteristic of the scattering of particles by various potentials, are very well illustrated 
in the scattering by a potential barrier or a potenaal well. 


'*The theory of scattering is given a more extended treatment in N. Mott and G. Massey, 
The Theory of Atomic Coll is ions, Hew York. Oxford, 1949, L. Schiff, Quantum Mechanics, 
New York* McGraw-Hill, 1955. 



Chapter 29 


Second Quantization 

A. SECOND QUANTIZATION OF THE SCHRODINGER 
EQUATION 

As an example of second quantization, we shall consider the 
nonrelativistic Schrodinger equation, and then generalize the re- 
sults to the case of the Maxwell and Dirac relativistic equations. 

In Chapter 5, it was shown that the solution of the time-dependent 
Schrodinger equation in the general case can be represented in the 
form 


n 



(29.1) 


where the coefficient C„(t) , which characterizes the probability 
that a particle is in the state represented by!p„, includes the time- 
dependent part of the wave function 


C„(0 = C„e (29.2) 

Here the quantities are the energy eigenvalues, and the eigen- 
functions of the time- independent Schrodinger equation satisfy 
the orthonormality condition 

K (29.3) 

In transferring from classical quantities to quantum-mechanical 
ones, the position .v, the momentum and the remaining quantities 
are replaced by their average values 

X— ^ <}i* (/).V'l<(0d'-v, 

/A= 5 


(29.4) 
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The time variation of these average values cannot be determined 
from the classical Poisson brackets, which in this case coincide 
with Hamilton’s canonical equations 


dxQi dH 

dt ^Pc\ * 

and must be calculated from the commutator relation 

R = (29.5) 

Here the Hamiltonian is equal to 

+ ( 29 . 6 ) 

The quantum equation of motion [Eq. (29.5)] can be considered 
as the fundamental equation describing the quantization of the 
classical equations of motion. This process is called first quanti- 
zation. In order to connect p and x by the usual relation 



we must assume that in Eq. (29.5) the operators p and r do not 
commute with each other and obey the commutation relation 

p.v_Ap = 4- (29.8) 

Thus the commutation relation (29.8), which is basic to the 
Schrbdinger theory, can be regarded as a consequence of the 
quantum equation of motion (29.5), and the transition from the 
classical equation of motion to the Schrcdinger equation is equiva- 
lent to a transition from corpuscular concepts to the wave con- 
cepts. 

In order to include in the theory the corpuscular properties 
of the de Broglie waves, it is necessary to introduce a number 
of additional hypotheses (for example, a hypothesis concerning 
the probabilistic nature of the ^ waves and the meaning of the 
coefficients 1 as the probability of an electron being in state 
n , and so forth). 

The process of second quantization enables us to take into 
account both the corpuscular and wave properties of the particles. 

The name “second quantization” originated from the fact 
that in this case we quantize the equation which has already been 
quantized as a result of the first quantization. We note that as a 
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result of second quantization, the coefficients €„ become operators 
(Q numbers), whereas in the Schrodinger theory they remained 
ordinary constants {c numbers). 

In order to carry out the second quantization, it is necessary 
to find the average value of the energy operator 

(29.9) 

Substituting (29.1) in place of <}^(/) and recalling that = we 
find 


n, n* 

Taking into account the orthonormality condition Eq. (29.3), we 
obtain 


H=='2Efil{t)CAt\ (29.10) 

n 

We note that the last expression is independent of since it 
is apparent from the equations 


C„(/)=C„e » , 


Cl{t) = C*J 


(29.11) 


that the time factor in the product Cn{t)-Cn(t) is simply equal to 
unity 

Cl(f)CAt) = OnCn^ 


Expression (29.10) is not an operator in the Schrodinger theory, 
since the coefficients are ordinary numbers (c numbers). In the 
theory of second quantization, however, these quantities and the 
Hamiltonian (29.10) should be regarded as operators, that is, as 
q numbers. 

To find the commutation relation for the coefficients Cn , we 
must substitute C^(0 and Cl{t) in place of x and pin the quantum 
equation of motion^ (29.5), and the operator H must be replaced by 
its average value //. We get 


^=1(//C„(0-C„(0//). 


According to Eq. (29.11), 








(29.12) 
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and therefore from (29.12) we find the following relation, which is 
a fundamental postulate of the second quantization: 


Substituting H from Eq, (29.10), we obtain 

- - C^€*Cn)- 

n 


(29.13) 


(29.14) 


The last equation has two solutions , corresponding to the Bose 
statistics and Fermi statistics, respectively. The first solution is* 


c„.c„-c„c„.=o. 

C„.C*-C;A- = 8nn'. 


(29.15) 


This can be easily verified by substituting the expressions 


C„Crt' : 


= C„A. 

= CA' + 8« 


(29.16) 


into the first and second terms, respectively, on the right-hand 
side of Eq. (29.14). We then obtain the identity 


-£.C,. = -2 




From Eq, (29,15), it follows that the coefficients and C% are 
operators. Setting 


CIC, = N, (29.17) 

where N is the number of particles in the state n , we find from 
Eqs. (29,15) 


C„Q=l-f iV. (29,18) 

It follows, therefore, that these operators do not vanish even in the 
case where there are no particles present. Indeed, even though 
= U when A/ = 0, we still have in that case C„CJ = L This 
nonzero value for the combination of coefficients C gives rise to 
a relationship between the vacuum (the field of virtual particles) 
and the real particles, 

^It follows from Eq (29 15) that the wave functions \}j{r, t) and £) commute, while 

ip and ip* do not commute 

ip(r, t) ip*ir, t)-\p*{/, 0 0(r, 0 - r) 
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It is clear from the above equations that the operator CJ should 
be regarded as the “creation” operator for particles, and C;, 
as the “destruction” operator. If at the initial time there are no 
particles present, the condition C„CJ=1 signifies that a particle 
can first appear (owing to the action of operator CJ), and then 
disappear (owing to the action of C^). The condition =: 0 im- 
plies that in the absence of particles the inverse process, in which 
a particle is first absorbed and then emitted, cannot occur. 

In the solutions (29,17) and (29.18), the quantity N must be a 
positive integer. It is not restricted to any maximum value. 
Therefore, these commutation relations correspond to the Bose 
statistics, which allows any number of particles to be present in 
one state. 

The second solution of Eq. (29.14) can be represented as 


-f- CfiCn^ = 0, 


(29,19) 


which is easily verified by the direct substitution of Eq. (29.19) 
into Eq. (29.14). Setting 


C*C^ = N. (29,20) 

we find 

C„C*=\ — N. (29.21) 

Noting that CnC„ and C„C* cannot be negative, we find that the number 
of particles in the state n can assume only two values: N = 0 and 
A^=l. Consequently, the Pauli exclusion principle is already in- 
cluded into this solution and the particles obey the Fermi statistics. 
In particular, if there are no particles present initially (A/^ = 0), then 
just as in the case of the Bose statistics we have 

C*C„=:=0. (29,22) 

The equation obtained in second quantization describes, there- 

fore, a state with a variable (integral) number of particles. Con- 
sequently, electrons will be similar to photons not only because 
their motion is described by a wave equation, but also because 
electrons, just like photons, can be created and destroyed. 

Since the creation and annihilation of electrons requires an 
energy more than twice as great as the rest- mass energy of an 
electron (since an electron is always created together with a 
positron), the second quantization of the nonrelativisticSchrodinger 
equation is of purely methodological significance. In order to 
consider real wave processes associated with the creation or 
annihilation of particles, one should extend the method of second 
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uantization to relativistic equations— namely, the wave equation 
or photons (Maxwell’s equations) and the relativistic wave equation 
3r electrons and positrons (Dirac’s equation). 


B. QUANTIZATION OF MAXWELL’S EQUATIONS 

It is well known that the photon field (electromagnetic field) can 
e described by a vector potential which satisfies the d’Alemberts 
quation 


= (29.23) 

nd is subjected to the following condition: 

V i4==0. (29.24) 

Since the vector potential A is a real quantity, we can represent 
he solution of (29.23) as 

(29.25) 

X 


^here 

’^ = 1^1, and It, = (/?^==0, =t:l,d:2, rb3...). 

The condition (29.24) means that the vector a is perpendicular 
o vector x , that is. 


x.a =0. 


(29.26) 


The normalization coefficient is determined from 

he condition that the energy of the electromagnetic field 


« = i S !(£)■ + = i S { (7 s)’ + 
+ ( V . /!)*} = 21 4’'- U«* ■ a) + («•»» *)1 


(29.27) 


nust be equal to the sum of the products obtained by multiplying the 
mergies of the individual particles by the corresponding squared 
.mplitudes. 

Since the Hamiltonian is proportional to the combinations of 
coefficients of the formC’^C + CC* (the amplitudes of a in this case 
)lay the role of the coefficients C), the quantum equation of motion 
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(29.13) for the field of photons permits only a solution corresponding 
to the Bose statistics.^ Furthermore, taking into account the con- 
dition (29.24), we obtain the following commutation relations: 

a, a? atUi =x: Au> = hi' — • (29.28) 

where = 2, 3 and the amplitudes and refer to the same 

vector X , which plays the role of the quantum number n [see Eqs. 
(29.15)]. The coefficient Aw expresses the transverse character of 
the photon field 

%^Aw = Xi>Aw=:0. (29.29) 

In particular, if there are no photons present at the initial 
instant, only the following bilinear combination of amplitudes will 
be nonvanishing: 


a,a? = S« 


(29.30) 


C. SPONTANEOUS EMISSION 

In Chapter 9 we found the Einstein coefficients A for the spon- 
taneous emission by using the correspondence principle. In such 
treatment, the reasons for transitions of electrons from higher to 
lower levels remained unexplained (see Chapter 9). Quantum 
electrodynamics (the name given to the theory that includes second 
quantization of the electromagnetic field) explains the transitions in 
terms of the interaction of an electron with the field of vacuum 
(virtual) photons. 

As can be seen from Eq. (29.30), there are quadratic com- 
binations of the quantized amplitudes of the electromagnetic field 
which differ from zero even in the absence of real photons. 

The time- dependent Schrbdinger equation, taking into account the 
doubly quantized field of photons, can be represented as 

Neglecting the second-order terms proportional to A^, and taking 
into account the transverse character of the electromagnetic waves 

= (29.32) 

these amplitudes occur in the Hamiltonian in the combination C*C ~ CC* (this 
happens, for example, for Dirac particles), then only the Fermi statistics would hold for 
the corresponding amplitudes This is easily verified by substituting into the Hamiltonian 
the different solutions for the Bose statistics [see Eq. (29.16)] and for Fermi statistics 
[see Eq. (29.19)]. 
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since 




Eq. (29.31) can be reduced to 

[- f I - ^ (29.33) 


Here the unperturbed Hamiltonian is time- 

ZfflQ 

independent, and the operator of the perturbation energy is 




Cttlo 


At) 


Let us assume that the electron is initially in the state n . Then 
under the influence of virtual photons ^ 


V'(t) = — -—A-p= i/— Xi / 'tSl 


fl*. p (29.34) 


the electron can jump into the state n ' . 

In accordance with Eq. (28.12), we obtain the following ex- 
pression for the coefficient Cn'(t), characterizing this transition: 


C „.(0 


L^/ittla ^ Y I — to) 

X 


a*-Pn'n, 


where the matrix element 


(29.35) 


Pn-n = J (29.36) 

and cD = cx is the frequency of the emitted photons. 

The quantity j is small since the wavelength of the emitted 

light is 10 ® cm, and the dimensions of the atom are of the order 
of 10 ^ cm. Therefore, in a first approximation, the exponential 
factor in Eq. (29.36) 

1 — /x*r + . . . 

should be set equal to unity. Such transitions are called the dipole 
transitions. ^ 


^Taking into account the commutation relations in (29 30) in the expression for the 
vector potential A in the case when there are no photons, we should retain only amplitudes 
proportional to a*, that is, operators for the creation of particles 

^If we include the next term in the expansion we obtain quadrupole radiation. Quadru- 
pole radiation is (r/A)^ times weaker than dipole radiation and is of importance only when 
dipole transitions are forbidden. 
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The probability of a spontaneous transition from the energy 
level n to the level n! is equal to 


V 

““ L^ml L 


2ti ( 
^0) " 


(/)C„. (0 = 

(a-p%'n){.a*- pn'n). 


dt 

sin t — a>) 


(29.37) 


Here instead of two sums over the wave numbers of amplitudes a 
we left only one, since the only combinations of amplitudes a and a* 


Of, a? 


^tv — 


different from zero are those which refer to the same momentum x. 
Using the last commutation relations, we find 

(a.pn'n) (a*, pn'n) === i T- 


Let us replace the sum over x by an integral in Eq, (29.37) [see 
also Eq. (28.23)] 


7>-2~5K' .( ‘'■—-W J 

and include the fact that for sufficiently large values of time t , we 
have, in accordance with (28.20), 


Tt 


sin^((o— 


(O) — ii)nn'y 


(29.39) 


Then in the case of spontaneous transitions Eq. (29.39) reduces to 
the law of conservation of energy 


W = <i)nn' 



(29.40) 


since from Eq. (29.40) it follows that the energy of the emitted 
photon hii) is equal to the energy (£„ — En' ) lost by the atom as a 
result of transition. For the probability of spontaneous emission, 
we obtain 


' \Pn'n I 


'l-Ehc^ml 




sln^dQ. 


Evaluating this integral and remembering that = — tniQu^nn'rn'n, 
we obtain the final expression for the probability of spontaneous 
emission 



SECOND QUANTIZATION 


489 


which was already derived in Chapter 9 with the aid of the semi 
classical correspondence principle [see Eq* (9.20)]. 


D. BETA DECAY 

As another example of the application of second quantization 
let us consider the theory of )3 decay. This phenomenon consists 
of the emission of an electron (positron) by a nucleus, leading to 
an increase (decrease) of the nuclear charge by unity. 

The theory of p decay, which resembles in some respects the 
emission of photons by atoms, was constructed on the basis of 
second quantization. 

There are no photons in an atom. A photon is created from the 
vacuum when an atom makes a transition from one energy state to 
another. Similarly, a nucleus does not contain electrons; they are 
created only in the process of p decay. 

As a result of experimental investigation of p decay it has been 
established that these electrons have a continuous spectrum bounded 
by a certain maximum energy equal to the difference in the energy 
of the nucleus before and after the decay. It was also established 
that in p decay the angular momentum of the nucleus changes by 
a multiple of H , whereas the angular momentum carried away by 
the electron equals 

The apparent violation of the laws of conservation of energy and 
angular momentum in p decay was resolved in the hypothesis of 
Pauli, who assumed that the emission of an electron is accompanied 
by the emission of another particle—a neutrino possessing half- 
integral spin and a rest mass close to zero. 

According to the Fermi theory, constructed on the basis of this 
hypothesis, p decay should be considered as the transformation of 
one of the nuclear neutrons (n) into a proton (p), an electron (e~), 
and an antineutrino (v) 

|-v. 

Similarly, the emission of positrons in p decay should be con- 
sidered as the transformation of a nuclear proton^ into a neutron, 
a positron and a neutrino ^ 

p-^n + 


^We note that since the rest mass of a neutron is greater than the total rest mass of the 
proton, electron, and antineutrino, it follows that the decay of a free neutron should also 
be observed. The decay of a free proton appears impossible from the energy standpoint and 
therefore, positron decay can be observed only in a bound proton, when the required energy 
can be taken up from the nucleus 

^We shall explain the difference between the neutrino and antineutrino at the end of the 
present chapter in the discussion of the nonconservation of parity 



490 


FUNDAMENTALS OF NUCLEAR PHYSICS 


Furthermore, the capture of a boimd electron is also possible; as 
a result of electron capture a proton is changed into a neutron and 
emits a neutrino (p + e -- n + ^) • As a rule, an electron from the K 
shell is absorbed in this process, and therefore this phenomenon is 
called K capture, /Ccapture is similar in nature to positron jS decay, 
since in both cases the charge of the nucleus is reduced by unity. 

We shall not consider here the details of decay; our task 
will be to describe in general terms the creation of an electron 
and antineutrino following the Fermi theory. 

The energy of interaction of a neutron with the electron- 
antineutrino field can be written as 




(29.42) 


where f is a coupling constant introduced by Fermi. The magnitude 
of / is very small (/?^ 1.4 x 10"“^^ erg x cm^ ) so that this inter- 
action is called a weak interaction. The spontaneous decay of 
particles is caused mainly by weak interactions; therefore, the 
lifetime of elementary particles or nuclei is comparatively large 
and varies from a fraction of a second to billions of years. Nuclear 
processes, on the other hand, are caused by strong interactions, 
which sometimes are a thousand times larger than the electromag- 
netic interactions. The duration of the processes caused by such 
interactions is very short (of the order of 10“23 sec). 

Neglecting spin effects, the wave functions and can be 
represented as 

^ (29.43) 


K there are no particles present at the initial instant, the following 
relations hold for the amplitudes a* and 6*: a*a = ^*6 = 0, an* = 

Then, according to Eq. (28.12), we have the following expression 
for the coefficient C{ty. 


C{t) = - V,„a*b* J dte-i^t^Xn-Kp-K-n . (29.44) 

0 

Here E„ — cHKn is the neutron energy and Ep = cfiKp is the proton 
energy. The matrix element of the interaction V'pa is 

Vpn= (29.45) 

where is the wave fimction of the neutron, Xp is the wave function 
of the proton, and t is the Dirac matrix which determines the 
nature of the interaction. Since the interaction energy must be 
a scalar quantity, an analogous matrix should also relate the spinor 
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amplitudes of the wave functions of the electron and neutrino in Eq. 
(29.42). However, the influence of the specific choice of the Dirac 
matrices manifests itself only in the spin effects, which are ne- 
glected in the present treatment. 

Just as in the investigation of dipole radiation, the quantity in 
the exponent in the matrix element (29.45) is much less than unity, 
that is, inside the nucleus 

(^ + »^)-r-|(^ + x)./?|<l, 

where R is the nuclear radius. The exponential, therefore, may be 
expanded in a series ^ 

I _ I r+ . . . 


If the matrix element (29.45) does not vanish upon replacement 
of the exponential by unity, the corresponding transitions are said 
to be allowed transitions* 

We note that the allowed transitions, which correspond to the 
dipole transitions in the theory of photons, are associated with 
definite selection rules. ^ 


^In the case of vector interaction, the matrix element 

(the fourth component of the four-dimensional velocity, the first three components for the 
nucleon at rest vanish) will differ from zero (allowed transitions) if the spin of the 
nucleus (that is, the total angular momentum) remains unchanged in the /? decay (AJ - 0, 
Fermi transitions) This Fermi selection rule is satisfied for the majority of nuclei. 
There are, however, some cases of allowed decay, for example, 

2He 3L1 + c +1/ 

in which the spin of the nucleus changes by unity (AJ - 1) The spin of the 2 ^ 0 ^ nucleus 
is equal to zero (one alpha particle plus two neutrons with antiparallel spins), while the 
spin of the 3 Li^ nucleus is equal to unity (one alpha particle plus one proton and one 
neutron with parallel spins, just as in deuterium). 

In order to explain these selection rules, Gamow and Teller pointed out that when we 
form the interaction energy operator, which is a scalar (V - Vp^V'^i^), one may take a 
product of two vectors, as well as other relativistically invariant combinations of the four 
wave functions For example, we may take a product of two pseudovectors, which is also a 
relativistic invariant The pseudovector interaction 

'pn - 

leads to the selection rules A J - 0, It should be emphasized that for a nucleon at 
rest, the first three components will be different from zero 

In view of the fact that the pseudovector interaction forbids 0 0 transitions, which 

were nevertheless observed experimentally, as well as for several other reasons, present- 
day theory uses a combination of vector and pseudovector interactions (as in the Feynman 
and Gell-Mann version of the theory of (3 decay). This version enables us to explain the 
basic experimental data obtained in the investigation of /? decay. 



492 


FUNDAMENTALS OF NUCLEAR PHYSICS 


In the subsequent discussion, we shall restrict ourselves to the 
treatment of allowed transitions; in this case it is sufficient to make 
the exponential term in the matrix element (29.35) equal to unity. 
We then obtain the following expression for the coefficient Ci 




flL^ 


Vpna^b^ 


C(K+'^-Knp) * 


(29.46) 


where l^o~(^^Knp = ch(Kn — Kp) is the energy carried away by the 
light particles (electron, neutrino or antineutrino) in jS decay. 



r’lg. 29.1 Energy distribution of electrons in 
fi decay (according to the Fcrnu theory), 

Eq is the maximum energy of the 0 spectrum 
The origin of the coordinate system corresponds 
to the energy ttiqC^ 

Using Eq. (29.39) and changing the summation over the momenta 
of electron and neutrino to an integration [see (29.38)], we obtain 
an expression for the probability of ^ decay 

w = ^ 2^*^ = S (K + X - Knp)- (29.47) 

k, X 

Integrating this expression over all possible angles at which the 
electron and antineutrino emerge from the nucleus and also over 
the energy of the antineutrino, we obtain an equation for the energy 
distribution (E — cHK) of the electrons (the jS spectrum) 

Eo 

I w(E)dE, (29.48) 


where 


(^) = E‘-mr{E, - Ef. 


(29.49) 
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The energy-distribution curve of the j3 electrons, obtained in the 
Fermi theory, is plotted in Fig. 29.1.® From this curve it is clear 
that the energy of the emitted electrons lies between £‘min=='Wo^^ 
and = Ep 

According to the Pauli-Fermi theory, there is no violation of 
the law of conservation of energy, since the total energy of the 
emitted antineutrino and electron must always be equal to the total 
energy lost by the nucleus during ^ decay. ^ 


E. NONCONSERVATION OF PARITY IN THE DECAY 
OF PARTICLES 


One of the fundamental discoveries in the theory of weak interactions was the dis- 
covery of the nonconservation of parity by Lee and Yang ( 1956^ 'This phenomenon gives 
rise to a spaaal asymmetry in the spontaneous decay of elementary particles and, in 
particular, in nuclear 0 decay. 

The nonconservaaon of parity can be observed experimentally in the following two 
phenomena, 

1, The asymmetry of the angular distribution of electrons in the 8 decay of nuclei 
with an oriented spin (the number of 8 electrons emerging along the direction of nuclear 
spin does not equal the number of electrons emerging in the opposite directionX 

2. The existence of circular polarization (helicity) in the particles formed during 
decay (for example, electrons formed in 0 decay or p mesons formed in the decay of 
71 mesons), even in the case when the decaying system has zero spin. 

The phenomenon of nonconservation of parity in 0 decay or in the decay of a tt meson 
was explained with the help of the theory that assigned a definite circular polarization 
(helicity) to the neutrino. At one time it was thought that the neutrino (which is formed 
in positron 0 decay) and the antineutrino (which is formed in electron 0 decay) were 
identical particles (Majorana*s hypothesis^ It was suggested that this hypothesis could 
be tested in double 0 decay. 

If the neutrino and anuneutrino were identical, one would expect comparatively 
large values for the probability of double 0 decay without the emission of a neutrino (one 
neutron of the nucleus emits an electron and a neutrino, while another neutron emits 
an electron and absorbs this neutrino), that is, with the emission of only two electrons. 
If, however, double 0 decay consisted simply of two successive identical 0 decays with 
the emission of two electrons and two antineutrinos, the probability of decay should be 
much less. Experiment has confirmed the correctness of the second hypothesis and 
clearly demonstrated that the neutrino must be different from the antineutrino. 

We note, incidentally, that both particles are neutral and have a spin 1/2, Physicists, 
however, were able to establish a difference between the parucles from phenomena 
associated with nonconservation of parity. It has been demonstrated experimentally that 
the asymmetry observed during positron 0 decay of a nucleus with an oriented spin, 
when a neutrino is emitted together with a positron, is the reverse of the asymmetry 
that IS observed during electron 0 decay, in which an antineutrino is emitted together 
with an electron. It was assumed, therefore, that the neutrino differs from the anti- 
neutrino by the type of circular polarization. In order to explain the experimental data, 
it was necessary to postulate that a neutrino resembles a photon with left-hand circular 
polarization, while an antineutrino resembles a photon with right-hand polarization. The 


g 

It should be noted that the maximum of this curve is slightly shifted towards small 
energies. This asymmetry is due to the fact that the antineutrino mass is equal to zero, 
while the electron mass is different from zero If the niass of the antineutnno were equal 
to the electron mass, this curve (neglecting the Coulomb attraction of the electron and the 
nucleus) would be symmetrical, that is, the maximum would occur at the point E/2 

^For more details, see H Bethe and P Morrison, Elementary Nuclear Theory, New York: 
John Wiley & Sons, Inc , 1958 
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only difference is that the spin of the photon is 1 (in units of n), whereas the spin of 
the neutrino is 1/2, 



Neutrino Antineutrino 


Fig. 29 2 Helicity of the neu- 
trino and antineutrino The 
neutrino has a left-hand circular 
polarization and the antineutrino 
has a right-hand circular polari- 
zation 



Fig. 29.3 Helicity of a left- 
handed neutrino in right-hand 
and left-hand coordinate 
systems 


The circular (longitudinal) polarization of the neutrino is generally called the helicity. 

The neutrino has left-hand circular polarization^ or negative helicity. This means 
that if a left-handed screw rotates along the direction of polarization. It moves in the 
direction of the momentum. The antineutrino, however, has right-hand circular polari- 
zation, or posluve helicity (see Fig. 29.2). In order to conserve helicity when changing 
from one Lorentz frame of reference toanoAer,so that it can be adopted as a character- 
istic of the neutrino, it is necessary that the rest mass of the neutrino be exactly equal 
to zero. 10 

Several authors describe polarization with the aid of an axial vector, which is per- 
pendicular to the plane of rotation and has a different direction in the right-hand and 
left-hand coordinate systems (see Fig. 29.3), It should be noted, however, that, although 
the axial vector s and the polar vector p have different mutual orientations in the right- 
hand and left-hand coordinate systems, the helicity is nevertheless conserved; that is, 
the helicity of the neutrino is still negative, and only the method of description has 
changed, ^ ^ 

Starting with the polarization properties of the neutrino, we shall make an attempt 
to give a qualitative explanation of the nonconservation of parity during the spontaneous 
decay of particles. 

Let us consider, for example, the 3 decay of nuclei with oriented spin. We note that 
the spin of the nucleus (its longitudinal polarization) is more naturally described by a 
rotation, since the direction of the axl^ spin vector is arbitrary. The spatial asym- 
metry which should be observed in the phenomena characterized by the nonconservation 
of parity is associated with the fact that in electron 3 decay, right-handed antineutrinos 
are emitted upwards and downwards with spins oriented, respectively, parallel and 
antiparallel to the spin of the nucleus. The electrons will be formed predominantly with 
a helicity opposite to that of an antineutrino (that is, with a negative helicity). This pro- 
duces a spatial asymmetry due to which the number of electrons emitted in the direction 
of nuclear spin does not equal the number of electrons emitted in the opposite direction 
(see Fig, 29,4). In general, we obtain the following equation for the number of emitted 
electrons as a function of the angle 0 between the direcuon of electron momentum and the 
upward direcuon, which two direcuons form a right-handed system with the polarization 
of a nucleus: 

W == — « cos S), (29.50) 


where a is positive and is equal to approximately 0.4, 

the rest mass of a particle differs from zero, the parallel spin and momentum vectors 
may be directed at an angle after transition from one Lorentz frame of reference to another 

Since the direction of the (axial) spin vector « relative to the polar vector of the mo- 
mentum p is different in the right-hand and left-hand coordinate systems, Lee, Yang, Lan- 
dau {Zhumal Eksperimental'noy i Teoreticheskoy Fiziki, 32, 405 (1957)], and others assume, 
on the contrary, that in this case the negative helicity of a neutrino changes to a positive 
helicity. In other words, they assume that the method of geometric description of the parti- 
cle can change its internal properties (helicity). 
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Asymmetry was detected in the electron 0 decay of Co 60 nuclei with oriented spins 
and also in the angular distribution of electrons in the 0 decay of free, polarized neutrons 
(for which 0. 1). 

Since a neutrino with a negative helicity emerges during positron decay, the asym- 
metry pattern will be opposite to the one described above. The number of emitted posi- 
trons is related to the angle ^ by the equation 

«’,+(») = (1 + a cos 9); (29.51) 


that is, the positrons that are formed have mainly a positive helicity and are emitted 
primarily upwards. An asymmetry opposite to that of electron decay was observed ex- 
perimentally in positron 0 decay of Co^S nuclei with oriented spins. 



Electron Positron 

0 decay 0 decay 


Fig. 29.4 Schematic diagram of the /3 decay 
of nuclei with oriented spins The direction 
of rotation characterizes the direction of the 
spin of the particles, p is the particle 
momentum 

Longitudinal polarization was observed particularly clearly in the spontaneous decay 
of plons into muons and a neutrino. Let us choose a coordinate system in which the pion 
is at rest and consider the negative pion which decays into a negative muon and an anti- 
neutrino. Since the antineutrino has a positive helicity and the momenta of the muon and 
the antineutrino must be equal and opposite, we find that the muon also must have a 
positive helicity. Indeed, only in this case will the total spin of the muon-antlneutrino sys- 
tem be equal to the initial spin, that is, zero. In the decay of positive pions into a neutrino 
and positive muons, the muons, however, will obviously have a negative helicity (see 
Fig, 29,5X 



TT Meson 



Fig. 29.5. Decay of a pion at rest into a muon and a neutrino 
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The total longitudinal polarization of the created muons has also been confirmed experi- 
mentally. 

A more detailed discussion of the nonconservation of parity lies outside the scope 
of this book, and we must refer the reader to the special literature on this subject, 12 


1 2 

See the papers in T D Lee and C. N. Yang, Nobel Lectures, Physics^ New York: 
Elsevier Publishing Company, 1964 



Appendix A 

Hilbert Space and Transformation Theory 


There is an elaborate formal structure of quantum mechanics 
that is important for several reasons: the wave mechanics and 
the matrix mechanics can be unified in one coherent scheme. The 
conceptual structure has a deep intuitive appeal that gives the 
theory a sense of completeness and solidity. The physical content 
can be embedded in an extensive and rigorous mathematical 
framework. And, most Important for the validity of the physical 
theory, it is a powerful and flexible phenomenological tool that 
embraces a wide variety of empirical knowledge. The conceptual 
structure and statistical foundations of quantum mechanics are 
formulated most fully in the framework of ‘‘abstract Hilbert 
space. It is, however, not necessary to go into the mathematical 
technicalities to achieve an accurate physical grasp of the theory. 
For a complete treatment of the mathematical and statistical 
foundations one should consult the original literature.^ In this 
appendix we shall outline some definitions and notational apparatus 
around three broad topics: vector spaces, operators and the inner 
product. 

Vector Spaces 

An abstract (complex) vector space is a set of abstract ele- 
ments called vectors (or points in the space) which together with 
complex numbers obey the following axioms: 

(1) If 'Pa and 'Pfc are vectors, ^ 'P^ = 'P^ f 'Pq is also a vector. 

(2) 'Pa h (VP^ + 'Pc) - ('Pa f ^h) + 'Pc. 

(3) If X and y are complex numbers and 'P is a vector, then x'P 
is a vector and x(y'P) - (xy)'P. 

(4) (X + y)^ - x'P + y'P and x('l^a + ^h) - x'Pa + x'P;,. 

(5) There is a null vector 'Pnuii - 0 such that ^ f 'Pnuii = ^ and 
0^ ^ ^nuii for all 'P. 


'p.A.M. Dirac» The Principles of Quantum Mechanics, fourth edition, Clarendon Press, 
Oxford, 1958. W. Heisenberg, The Physical Principles of Quantum Theory, translated by 
C, Eckart and C. Hoyt, University of Chicago Press, Chicago, 1930. (Dover Publications, 
New York, 1949). J. von Neumann, Mathematical Foundations of Quantum Mechanics, 
translated by R. T. Beyer, Princeton University Press, Princeton, 1955. 
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The structure of a vector space is appropriate for introducing 
the notion of linear independence: the set of vectors '^ 2 * • • • i 
H'jfe are linearly independent if the only set of complex numbers z\, 
22* • — 1 satisfying the relation 214^1 + 22^1^2 + • • • + zk^k = 'Pnuii 
is the trivial set 21 = 22 -•••== 2* = 0 . If it is possible to specify 
n linearly Independent vectors but notn + 1 , the space is said to be 
/z-dimensional. If there is no limit to the number of linearly inde- 
pendent vectors that may be specified, then the space is ‘‘infinite 
dimensional/’ In an n-dimensional space a set of n linearly inde- 
pendent vectors 'Pi, 'P2, . . . , 'P^ forms a basis that spans the space 
in the sense that any element 'Pa of the space can be written as a 
linear combination of the basis vectors - 1 , 2 , . . . , tz, 

n 

fa - E 0 ,f, 

1 

The coefficients ai, i 1 , 2 , . . . , « are complex numbers that char- 
acterize the vector 'P^ in the basis 'Pi, 'P2, . . . , 'P^. If and bi, 
/ = 1 , 2 , . . . , are the components of the vectors 'P^ - and 

'Pfe - then Cl ^ ai + bi are the components of the vector 'Pc - 

= 'Pa f and zai are the components of the vector 2'Pa, 
where 2 is a complex number. 

The geometric concept which is the fe-dimensional generaliza- 
tion of the notion of “lines and planes passing through the origin,” 
can be developed with the following definition: 

A subset SW of a vector space is called a linear manifold if it 
contains all the linear combinations yi'Pi i- y2'P2 + • • • + yk^k along 
with any kik ^ 1 , 2 , . . . ) of its elements 'Pi, 'P2, . . . , 'Pfe. Alterna- 
tively, if a is an arbitrary set of vectors containing the distinct 
vectors Wi, ^2^ • • • 1 fhe set of all linear combinations 

3 ci'Pi + x 2^2 + •••♦- xk^k (with arbitrary complex numbers x\, X2, 
. . . , xfe) is a linear manifold SH. SD? is called the “linear manifold 
spanned by a.” If the vectors 'Pi, 'P2, . . . , 'Pj^ are linearly inde- 
pendent, the manifold is fe-dimensional. 

The utility of the concept of a linear manifold is that it is the 
domain of definition of a linear operator. In general an operator 
is a mapping or correspondence from a domain consisting of cer- 
tain points in the vector space into a range consisting of certain 
other points, denoted symbolically by 'P ^ R(^). In quantum mech- 
anics we are concerned with linear operators for which by defini- 
tion 


R(^ + 4>) = Rm + Ri^) and Ria^) = aR('P) . 

That is, the domain of definition of the operator R is a linear 
manifold. Note that a linear operator need not be defined for all 
vectors in the space. For example, if the space is the “function 
space” consisting of square integrable functions on the interval 
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_«, < X < oo, then there are vectors (functions) for which xi/r or 
di/f/dx are not square integrable. 

There is a “projection property*’ associated with a linear 
manifold: any vector 'P may be resolved into the sum of two vec- 
tors 'P ^ ¥|| + 'Pj, where ^\\ is in the manifold 9H (i.e., there are 
complex numbers ci, C 2 , . . . , ck such that 'P|| - ci'Fi + 02^2 ^ 

where '^i, 'l^ 2 » • • • » span the SK) and^j is entirely outside 
2H. As a notational device we may define the linear operator 
as follows : 'P 11 = Pmj’P for all 'V. The domain of definition of Pgjj is 
the entire space, while the range consists of SDl and the null vector. 
If 4*1 lies outside SR, Pg)i^i = 'Pnuii- If 'P 2 Hes wholy within SR, Pg>j’P2 = 
S' 2 . If S' is an arbitrary vector, lies entirely within W and 
Pgjj^S' - P(((i(Pg)i***) ^ I’art’*'* Since this relation holds for all S', we may 
assert the operator equation 


= % 

An (Hermitian) operator satisfying this relation is c ailed idempotent 
or a projection operator. It has eigenvalues 0 and 1. 

Operators 

Let 8 ( stand for a physical observable which upon sharp meas- 
urement yields any one of a sequence of values af, 82 * • • • which 
are characteristic of the observable. The set of values Hi, 82 . • • . 
is called the spectrum of 8 . An algebra of observables is set up 
as follows; let e be a complex number, then zft is an observable 
with characteristic values 281 , 282 , • • • and if a measurement of 8 
yields 8 *, the measurement of 28 yieldsz 8 ft. The observable 8 ^ = 
88 yields the measured value ( 8 *)^ when 8 yields 8 *. If 8 and 8 
are simultaneously measurable then the sum 8 t 8 Is defined to be 
the observable that yields 8 * f 8; when 8 yields 8 * and 8 yields 
8 /. Finally, the product 88 may be defined by the artifice 

88 - * (8 + 8 )^ - -(8 - 8 )^ 

4 4 

if SI and S& are simultaneously measurable. These definitions pro- 
vide a “physical” construction of polynomials of several simul- 
taneously measurable observables SI, 9, etc. 

In quantum mechanics the mathematical idealization of an ob- 
servable is a linear operator on a Hilbert space. Let us first out- 
line the purely algebraic aspects of operators which can be defined 
without reference to the space on which they act. An abstract op- 
erator algebra is a collection of elements called operators which 
together with complex numbers are endowed with the structure of 
a vector space (in a technical sense, not to be confused with the 
Hilbert space) and in addition an operator product. Explicitly, 
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( 1 ) If M and N are operators, then M + N = N + Mis also, 

( 2 ) M + (N f 0 ) - (M + /V) + 0 . 

N ( 3 ) Multiplication by a complex number is allowed; i.e., xM is 
an operator and x{yM) - (xy)M, 

( 4 ) (x + y)M = xM ± yM and x(M ± N) ^ xM ± x/V. 

( 5 ) There is a null operator 0 such that 0 + IVJ - M and OM - 0 
for all M. 

Furthermore, there is a composition law (the ‘‘operator prod- 
uct'O which assigns to each ordered pair of operators (M,N) another 
operator which is denoted by MN. The composition law obeys the 
following axioms (we now denote operators by Ti, T2, T3): 

(a) T1T2 is a operator if Ti andT2are; i.e., the operator algebra 
is closed with respect to the product, 

(b) Ti{T2Ts) - (T\T2)Ts (associative rule), 

(c) Ti(aT2 + bT^) = aTiT2 bTiT'^ and (aT2 i bT^)Ti - aT2T\ + 
bTsTu 

(d) There is a unit operator Tumt - 1 such that IT - TI - T for 
all operators T. 

Once a concrete identification of the operators is made and a 
labeling scheme adapted a “multiplication table’' can be set up. 
The multiplication table can be formulated with the help of the 
notion of linearly independent operators. Assuming there are only 
a finite number n of linearly independent operators Ti, T2, . . . , Tn, 
we may express any operator as a linear combination of these. In 
particular, as the operator algebra is closed, 

T,Tj = , 

where the complex coefficients are the structure constants of 
the algebra. 

The algebraic properties defined by operator axioms (l)-( 5 )and 
(a)-(d) can be realized concretely in terms of linear transforma- 
tions on a vector space. The operator Ti is identified with the 
transformation which maps ^ into Ti'F, The transformation is 
linear 


Tiia^i + bW2) aTiWi + 6Ti'P2 • 


Sum and difference Ti + T2 is identified with (Ti ± T2)'P = Ti'P ± T2'P, 
while zTi corresponds to the transformation izTx)^f - ^(Ti'P), these 
relations being valid for all m for which Ti'P and T2'P are defined. 
If the range of T2 includes the domain of Ti, the operator product 
T1T2 is defined by (TiT2)'P Ti(T2'P) for all V in the domain of T2. 

A transformation is nonsingular if it transforms distinct vec- 
tors into distinct vectors; that is, if / 'P2, then T'Pi -/ T'P2. An 
alternative definition: T is nonsingular if to each vector 4 >i - T^i 
there is a unique solution Wi. In particular, ifT'Pi - WnuW* then 
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^ 'Pnuii. The unique solution 'Pi may be expressed in terms of 
Oi by an operator S defined by the relation 'Pi = S^i. It then fol- 
lows that 'Pi - ST'Pi. If this latter equation holds for all 'Pi, we may 
assert the operator equation ST = 1; S is called the left inverse 
of T. If TS - 1, then S is the right inverse of T. If ST - 1 = TS, 
then S is the inverse of T and is given the notation T“^ also, T = S"^. 
If A and B have inverses, then the inverse of AB is (AB)"^ 
even if A and B do not commute. Finally, if the vectors 'Pi, 'P 2 , 

. . . , W/j are linearly independent and span a manifold 3B, and if V 
is a nonsingular transformation, thenV'Pi, V'P 2 , . . . , V'P/j are linearly 
independent and span the transformed manifold V2W. 

Inner Product 

An Hermitian inner product is a mapping that assigns to each 
ordered pair of vectors W, a complex number designated by ('P, 0), 
The number ('P,<I)) is called the inner product of 'P and<I). An inner 
product has the following properties: 

('P, <b) = (O, 'P)* (Hermitian symmetry) 

('P,a$ + 60) = a<'P,0) + 6('P,0) 

('P,'P) 1 0 and(^,^) = 0 implies 'P - 'Pnuii. 

The norm or ^Hength^’ of a vector 'P is 1 | W|| = V('P, and the dis- 
tance between 'P and O is \\^ - <^|1, If (W, <I>) = 0 the vectors are 
orthogonal. A set of vectors 'Pi,'P 2 , ... forms an orthonormal 
basis if ('Pj, 'Pfe) - Sife , /,/e - 1,2, . . . • Some important theorems are 
the Schwartz inequality |(V,<I>)| <||'P|1 • i|^|| and the triangle in- 
equality Il'PIl + 110 II ^ n't' + 0||, For each linear manifold there is 
an orthonormal basis which spans DW, 

The projection operator of a manifold SW spanned by the 
vectors 'Pi,^ 2 » • • • » be written in terms of the inner prod- 

uct as 

PsDi'P = E 'P) 

1 = 1 

The projection operator onto the ray or one dimensional manifold 
defined by 'P, is P^^ ^ 'P,('P|,'P). If 'Pi,'P 2 , ... is an orthonormal set 
the projection operators are related to the vectors by the equations 
The orthonormality property is PiPk - ^ik^k and the 
completeness relation is SP, = 1, the unit operator, 

I 

Let *Pi, 4*2, .. . be the eigenfunctions of an operator M corre- 
sponding to the eigenvalues Ml.Ma, .... For simplicity, let us 
assume the eigenvalues are distinct. Mi / M^for/ /?;then {1',,'l'fe) - 0 
and the vectors may be normalized so they form an orthonormal 
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basis. The relation of the projection operators Pj of this ortho- 
normal basis to the operator M is the following: MPj = PjM ^ ^'jPj* 

The spectral resolution of M is M = Let F(A)be an arbi- 

;= 1 

trary function of a real or complex variable A. The operator func- 
tion F(M) is defined by 

00 

F(M) = X) 

i= 1 

In the mathematical realization of an observable SI in terms of 
an operator A, the eigenvalues A 1,^2 of the operator A are identi- 
fied with the characteristic values Sli, !t 2 of the observable. The 
construction of polynomials of observables is identified with the 
construction of functions of an operator given just above. 

There is a particularly important notation, the Dirac notation, 
in which a ket ] > is placed around the symbol ^ to denote a vector. 

1 'P > is called a ket vector. The inner product between vectors 'Pq 
and is denoted by <^a\^b>* The expression < 'Pa I is regarded 
as a vector in its own right; it is called a bra vector or simply a 
bra. The primary distinction between bra and ket vectors is that 
if the abstract vector 'P transforms as 'P L/'P under a unitary 
transformation U, the bra |^>^i/|'P>=|(7'p>, while the ket <'P| 
transforms (contragrediently) as <'P| ^ <^1 = <'P|(7^ = <'P|U"^. 
Under these transformation properties the inner product <'P|^> ^ 
remains invariant. Matrix elements of an operator M 
are written as <'P|Ml^> ('P,M<^). In the Dirac notation, the de- 

fining equation for a projection operator reads 


P9hI'P> - El'P,><'P,|'P> 

t- 1 

for an arbitrary ket | 'P >. Since 1 'P > is arbitrary it is a permis- 
sible notatlonal device to omit writing the | V >. Then the expres- 
sion for a projection operator in the new notation is 

Pm = E I'J’.x'P.! 

where the sum runs over the vectors 'Pi,'P 2 , . . . , 'P^ which span 
the manifold 9W. An arbitrary operator A may be expressed in 
terms of its matrix elements A^k = in a complete ortho- 
normal basis • by the notation 

-4 - E |<D.><$a1 

i.k 

There is a notatlonal simplification possible when working with 
a particular basis . in that the ublqulous <I> is a redundancy 
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in the notation: the ket vector 1 0 , > can be abbreviated to read | f >. 
The Hilbert space for a specific physical problem is built on the 
cononical coordinates Qi,Q 2 ^ • • * ^ system of fe-degrees of 

freedom. The abstract vectors have no numerical significance, 
rather the q^s serve to label what will be considered a complete 
orthonormal set, ^qi,q 2 ^ • • • » The ket will be denoted by Iq^, 
^ 2 ’ • • • » >• (Improper) 5-function normalization scheme 

the orthonormality relation is 


^ 2 ’ * • * * * • * * ^ ~ qi^^^q2 “ ^ 2 ^ * * * ~ 

An arbitrary vector 'Pa may be expanded in the ^q basis: 

1 'Pa > = y*. • •J'llfaiqi, 92’ •••»«*> I ^1’ 92 q^xiq^dq^. . . dq^ 


where the expansion coefficient ^fj^qi,q 2 ^ ...» is the probability 
amplitude for finding particle 1 at position etc.; i.e., 

. . . ,q^) is the Schrbdinger wave function in the coordinate repre- 
sentation. Taking the inner product of | 'Pa > with the bra << 7 ^ . . . < 7 ^ 1 
and using the orthonormality relation, one may verify that 


= <^V^2 

In quantum mechanics a particle may have a discrete internal 
degree of freedom, spin, for example, where the discrete variable 
o takes on either of two values, t for spin up or ; for spin down. 
For a particle with spin and one spatial degree of freedom x, an 
arbitrary state 'Pa may be expanded as 

l'Pa> = Ey* 


where $<y(x) is the probability amplitude for finding the particle at 
X with spin a (see Chapter 16). 

Treating now a particle on the interval - «. < x < «> with no spin, 
we write the inner product of two vectors Vq and 'P* by first ex- 
panding tf, in terms of the set W* 





then taking the inner product using the distributive rule under the 
integral. The result is 


^00 

(^a.^6) = / ('Va,^x)</fb(x)dx 



\//a(x)* \&b(x)dx, 
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the ordinary tunction space inner product. The projection operator 
Pab onto the interval a < x < b is 


Pab'V 


/: 




or in the Dirac notation 


^ab 



^x>dx<^x\ . 


An arbitrary operator M is expressed in terms of its jc- space 
matrix elements <x|M| 3 ' > - by the formula 


M = 



<x\M\y> l^xX^yldxdy 


the action of an operator M ona (ket) state vector | ^ > is written as 


M\m> 



Wjc ^ I M 1 3 / >< y I 'P > dxdy y 


where it is now appropriate to call ^x|jyf|>'> the kernel of the 
integral operator M. The average value M, or what is the same 
thing, the expectation value of M in the state <t>, is 





<.^|x><x|Mly><y|<I)>(ixd>' 


An operator V is called local if its matrix elements have the 
special form <x|M|y > - ^(x - y)V(x), where dix - y) is the Dirac 
S"function and Vix) is an ordinary function. The expectation value 
of a local operator is 


V - 



< 0 1 X > V (x) < X I ^ dx , 


The kinetic energy operator T has the matrix element < x [ T 1 y > = 

^x8ix - y), which is also regarded as a local operator. With 
2m 

the help of Integra tion-by- parts the expectation value of T can be 
written 


2 

f - / cfiix)* yx^Ax)dx y 

2m 


where 0(x) = <x| 0 



Appendix B 

The Statistical Assertions of Quantum 
Mechanics 


We consider a cononical system of k degrees of freedom, em- 
ploying the coordinates to specify its configuration and 

the cononically conjugate momenta to specify Its condition of 
motion. In the wave mechanical mode of description, in the co- 
ordinate representation, everything that can be said about the 
‘‘state of the system at one time’^ (its configuration, condition of 
motion and the values of all Its physical quantities) must be de- 
rived from the wavefunction The functions admitted 

as wave functions are those that are square integrable (normaliz- 
able, 11 011 finite) and furthermore normalized to unity, 


^00 ^ 0 

•'-00 •'-00 


’ 9/,)! • dq^ = 1 


although in applications to continuous spectra this requirement 
may be relaxed. There are three primary statistical assertions: 

(1) the probability of finding the system within a volume V of con- 
figuration space is 

J"- ■ -J I • • • • \^dq^ ■ . . dqf^, 

V 

(2) if the energy of the system has the operator H with eigenvalues 

E\,E 2 , . . . and eigenfunctions then the probability that 

the system has the energy value Ej in the state <f> is 



lA/?! • ■ • 9*)* . Qk^dq^ • • - dq^ , 


and (3) the average value of a physical observable St to which the 
operator A corresponds is 


^00 

•'-00 •'-00 


<l>(qx, 


9*) A 9i 




^9i 




9i<9i. • • • , 9j|,)d9i • • • dq,^ 
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In the quantum state The interrelation of these three assertions 
will now be examined. 

In order to present assertion (1) in a general form, we intro- 
duce the projection operators Pj(lj)y ; = 1, • • • , associated with 
the intervalue Ij defined by q'j < qj < q'J. The projection operator is 
defined by 


P,(/,)<D(9i, ... ,9^) 


. ,q^) for q'j < qj < qj 
0 otherwise . 


The projection operator associated with the (rectangular) volume 
V is Pi(/i)P2(/2) ‘ and the integral specifying the prob- 

ability of finding the particle described by 0 in V may be reduced 
to the expression 


|lPl(/l)P2(/2) - • - 

For assertions (2), let Pe„ denote the projection operator associ- 
ated with the eigenvalue of P, that is, 

Pe^ 0m ^ 0n for m ^ n, and = 0 for m / n. 

The probability of finding the value upon a sharp measurement 
of the energy of a system in the state can then be put in the form 

The probability of finding the system within an energy interval 
E' < E <: E' is 


where P(/b) is the projection operator associated with the interval 

P(/e) - E • 


Assertions (1) and (2) may now be unified in what is called ^‘the 
most general probability assertion possible’’: 

Statistical Postulate: The probability that in the state 0 
the physical quantities with the operators . ,A^take 

on values from the^ respective intervals /i,/ 2 » • • • is 

w = llPl(/l)P2(/2) - 

where Pi,P 2 , . . • Pm are the projection operators belonging 
to the operators Ai ... Am. 
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Assertion (1) has m = fe, with A\ - A 2 -- * • • » == Qk\ while 

assertion (2) has m - 1, with Ai - H. In order to insure that this 
postulate is a coherent statistical statement the following prop- 
erties must be varified: 

(a) Since the order of the operators is arbitrary in formulating 
the statistical question, the order of the projection operators in 
the statistical assertion must be immaterial. This implies that 
the projection operators must commute for arbitrary intervals 

, and this in turn implies that the operators 4 1 ,^ 2 * 

. . , ,Am must commute among themselves. 

(b) Vacuous propositions may be inserted at will, vis. if the 
interval l' is contained within the interval PiDPU') - Pil'), or 
the projection operator for obtaining any value whatever is the 
identity. 

(c) If the interval Ij lies outside the spectrum of for some j, 
the corresponding projection operator is the null operator and the 
probability is zero. 

(d) Probabilities are additive; i.e., if an interval / is resolved 
into two disjoint subintervals 1^1+ r, with operators P(l') and 
Pil") corresponding to I' and r, respectively, then 

1|P(/)<1>||^ - ||P(/')4)|l^ + ||P(/'')<I>|1^ . 

(e) The total probability W ranges over the values 0 < < 1 

for normalized 

Assertion (3) at the beginning of this appendix, can be expressed 
in terms of the inner product. If we introduce the notation Exp 18, <I>! 
to stand for the statement “the expectation value of the physical 
observable 8 in the state assertion (3) may be stated 

Expl8,^>! - A - (0,A<^), 

where A is the operator corresponding to 8. Let PMX)be any func- 
tion of the real parameter A. There is a theorem to the effect that 
if the observable 8 has the operator A, then the observable F(8) 
has the operator F(A), Assertion (3) may then be generalized to 
read 


Exp{P(Sf),<D| = F(A) -- (<I>,P(A)$) (3') 

Assertion (3^) can be derived from the statistical postulate as 
follows: subdivide the interval (-(*,<») into a sequence of subinter- 
vals In = lA„,A„+il, where -00 < A-„ < • • • < ^ A-i < Aq < Ai < • • • > A^ < 
+ 00 . Let AA be s ome number in the interval \j < \j < A^+i. The 
average value F(A) in this mesh is 

n 

F(A) = X) P<A;)(<1>,P(/;)«>) 
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If we introduce the monotonic increasing projection operation E(A) 
defined by 


E(X) - X) 

Xj<X 

(conversely, Pilj) - E(\j+i) - E{\j)) and let the w ~ and the mesh 
size vanish, the sum approaches the Stieltjes integral 


FiA) ~ 



F(A)(^,dE(A)<D) , 


which by definition of a function of an operator is So 

far we have indicated that assertions (3) and (3^) follow from the 
Statistical Postulate. What is remarkable is that the converse is 
also true: the Statistical Postulate follows from assertion (3), The 
proof of this statement is obtained by a technical application of the 
following theorem: Let Ai,A 2 , . . . be a set of mutually com- 
muting operators; there exists an operator /? and functions Fi (A), 
F 2 (A), . . . ,F^(A) such that Ai - Fi(E),A 2 - Fo(F), . . . ,Fm - F^(/?). 

In the preceding discussion we have assumed that the state is 
a pure state, that is, it is described by single vector fl>. In general, 
in the physical preparation of a state some of the variables are 
left uncontrolled and the state is not completely specified. This 
situation can be formulated in terms of a classical probability 
distribution that is superimposed on top of the quantum mechanical 
uncertainty. Let us suppose that the system is in one of the states 
= 1,2, . . . , but we don't know precisely which one. Let Wa be 
the classical probability that the system is in the state The 
basic statistical postulate can now be reformulated as 


w Y, 'Vail Pl(/l)P2(/2) • • • Pm(/m)'Pall^ 

a 

and assertion (3) can be generalized to read 


Expl 

a 

The states 'I'a are an arbitrary set of states; they need not be 
mutually orthogonal. For the IVa's it is required that 0 and 
- 1 . 

The statistical assertions can also be formulated in terms of 
the trace. The trace of an operator A is 


trA ^ X! <'•'<. A*?,) , 

t 

where the sum runs over a complete orthonormal basis • • • • 

The trace is independent of which basis is used to define it. The 
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statistical assertions are 

W - trpPi(/i)P2(/2)- • • Pm(im) 


and 


Expl = tvpA 

where p is the density matrix (operator) 

p = EW'aPa - El'Pa>»V„<'PJ 

a a 


The density matrix is useful for making a general statement about 
the time evolution of the system* Expectation values change with 
time according to the general rule 

- ExplSr ^ trlpA + pA\ . 

dt 

In the Schrddinger picture the burden of change is put on p; namely, 
A - 0 and +ih p ^ lip ~ pll, where H is the Hamiltonian. In the 
Heisenberg picture p = 0 and - iH A - HA - AH, (see Problem 8.4). 
The Hamiltonian controls the evolution of an isolated system. When 
the system is subjected to a measurement of one of its observ- 
ables R and the eigenvalue R' obtained the uncontrollable disturb- 
ance of the measuring process forces the system discontinuously 
into the state described by the eigenvector corresponding to 
the eigenvalue R\ If the measurement of R is repeated (before the 
evolution generated by the Hamiltonian moves the system into 
another state) the system is already in an eigenstate of Pand 
thus yields the value R' with certainty. 




Problems 


Chapter 1 

1.1 Express the space and time dependence of E, H and A in 
terms of amplitudes and phase angles for a plane wave 
moving in the positive x direction. What are the conditions 
for plane or circularly polarized light? What is the initial 
data at t = 0 for solution of Maxwell’s equations or the wave 
equation for A? Express the solution in terms of a power 
series in time. Express the solution for t < 0 in terms of a 
solution for f > 0 with appropriate initial data. 

1.2 Determine the magnitudes of E and H for unpolarized sun- 
light characterized by the solar constant (Poyntlng vector) 
S* = 2 cal'icm ^min"*. What is the ratio of electric to mag- 
netic force on an electron moving with the speed of an elec- 
tron in the first Bohr orbit? 

1.3 Compare the wave and photondescrlptions of normal reflection 
of sunlight from a mirror. What is the energy density, pres- 
sure against the mirror and the density of photons? Take 
X = 5500A. What is the electric field intensity of a photon 
absorbed in a 1 cc detector? 

1.4 Find an expression for the angular momentum density of an 
electromagnetic field. Show that a photon carries an ai^ular 
momentum of magnitude t. Calculate the torque on a quarter 
wave plate by a normally Incident left-clrcularly (right- 
screw) polarized light. 

1.5 Determine the spectral density of radiation under the as- 
sumption that the oscillator representing the behavior of the 
walls for angular frequency w can assume the energy value 
E = 0 or any one of a continuum of values B > Show that 
the gap in the energy spectrum is related to the behavior of 

in the quantum region ftT « ta. 

Answer. The partition function Z = = 1 + e‘“"/a(u. 


Chapter 2 

2.1 Consider the scattering of 4 MeV alpha particles on gold 
atoms (Z = 79). Show that the distance of closest approach is 

^min “ (ZZeo2/2EQ) (l f csc~6j) and evaluate it for scattering 
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2.2 


2.3 


2.4 


2.5 




2.8 


2.9 


angles 5®, 20® and 80°. At what angles will the scattering 
deviate from the Rutherford formula? 

Consider the scattering of alpha particles from gold foil. A 
radium source yields 4.8 MeV a-particles at a rate of 3.7 x 
10^^ particles per second per gram of Ra. The target has 
5.9 X 10^^ atoms-cm a thickness of 4 x cm, and a 
cross sectional area of 8 mm^. The target is situated 1 cm 
from the source. Consider a detecting screen of 2 mm2 area 
5 cm from the target. What amount of radium is required for 
a counting rate of 30 per minute at scattering angles 10° and 
120°? 

Derive (2.46) for a general one- dimensional potential by 
evaluating d\/dE directly. Reduce the validity of the Bohr 
quantization rule for large n by equating the classical ex- 
pression for the frequency of the expression in terms of the 
energy level spacing. (This application of the correspond- 
ence principle shows that (2.46) and (2.47) give a consistent 
classical limit if the same constant t appears in both equa- 
tions.) 

Examine the classical and quantum descriptions of the emis- 
sion of light from a harmonic oscillator. On the basis of the 
correspondence principle, argue that the ft appearing in the 
Bohr quantization rule is the same Planck's constant char- 
acterizing the corpuscular nature of light. Deduce the energy 
level spacing and infer the levels between which electric 
dipole transitions are possible (selection rules). 

Use the correspondence principle to infer the energy levels 
of a rigid rotator consisting of a mass m held at distance b 
from a fixed axis of rotation. Note that the angular momentum 
is independent of the mechanical parameters. 

Show that for circular motion in a general centrally sym- 
metric field, the angular momentum is quantized in steps 
of 

Find the energy levels of circular orbits in the Bohr plane- 
tary model of the hydrogen atom using the correspondence 
principle directly. Note that as usual n is left undetermined 
up to an additive constant of integration. Infer the selection 
rules for electric dipole transitions between circular orbits 
and find the accompanying angular momentum change. 
Estimate the lifetime of a stationary Bohr orbit by computing 
the time required for the corresponding classical motion to 
radiate away an amount of energy equal to the quantum energy 
level spacing. Compute the cumulative time for several suc- 
cessive transitions between circular orbits and compare it 
with the time taken to radiate the same amount of energy 
classically. 

Use the Bohr quantization conditions to determine the energy 
levels of an isotropic three-dimensional harmonic oscillator. 
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Let the potential energy be | and use polar coordinates 

in the plane of the classical motion. 

2.10 Find the energy levels of the relativistic linear harmonic 
oscillator according to the Bohr-Somerfeld quantization rule. 
Evaluate the lowest order correction to the non- relativistic 
energies. The potential may be introduced via the scalar 

potential - 2 replacing the mass invariant by 

Mqc^ + ^ kx^. 

Chapter 3 

3.1 Calculate the deBroglie wave lengths of the following par- 
ticles each with a kinetic energy 500 keV: photon, electron, 
proton, and alpha particles. Also of thermal oxygen atoms at 
300°K. 

3.2 What is the dispersion law co - (oik) if the group velocity is 
inversely proportional to the phase velocity? How is this 
case realized physically? 

3.3 Compare the reflection of a particle and a wave from a 
moving surface. Show that AE/Ao - Ap/A/e, where AE(Ap) is 
the change in the energy (momentum) of the particle and 
AG>(A/d is the change in frequency (wave number) of the wave. 
Show this independently of the dispersion law of the wave and 
the energy- momentum relation of the particle. How can this 
result be generalized and what is its significance? 

3.4 Calculate the deflection of a charged particle by a thin slab of 
magnetic field, expressing the change of momentum in terms 
of the vector potential. How is the difflection explained in 
terms of the wave picture*^ What is the relation between mo- 
mentum and wave vector in the presence of a magnetic field? 

Answer, = 0 across the slab and p + Ae/c -- 1ik, 

3.5 Consider a particle bouncing back and forth in a rigid box of 
linear dimension L. What is the minimum measurable ki- 
netic energy of the particle? How much energy is required 
to constrain an electron to remain within a volume of nu- 
clear size? 

3.6 Determine the maximum time a free particle will remain 
within a volume of radius R by considering the limitations on 
the specification of the initial data in the classical descrip- 
tion of motion. 

3.7 Find an uncertainty relation connecting angular momentum 
and angular orientation. Consider a rigid rotator. What is 
the minimum uncertainty in L? 

3.8 One billiard ball bounces on another with a center- to-center 
height of ten times the radius. What is the optimal horizontal 
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localization to maximize the number of bounces? What is the 
maximum number of boimces? 

Answer, n ~ 50. 

3.9 What is the optimal localization of an ideal pendulum to 
maximize the time it will remain balanced in an inverted 
position and what is the maximum time? 

Answer. About six times the period for small oscillations. 


Chapter 4 

4.1 Under what circumstances can a narrow potential be approx- 
imated by a 5-function? What is the effect of a 5-function 
potential on the continuity property of a solution of Schrod- 
Inger’s equation? 

4.2 Show that the potential Vix) = ~gd(x^ - a^) has a discrete 
anitsymmetric eigenfunction for g > Ti^/m and not one for 
g < ^Vm. 

4.3 Consider the potential V(x) = oo for |jc| > L and Vix) ^ ~Vq5(x) 
for \x\ < L. Show that for Vo > 0 there is one eigenvalue E < 0 
and that it remains discrete as L approaches infinity. 

4.4 Find a complete set of orthonormal eigenfunctions for the 
potential V = ~1 Vq 15(x) on the interval < x < «>. Check the 
completeness relation (4.67). 

4.5 (Sturm- Liouvllle theorem) Let i/zi and t/zg be solutions with 
energies and E 2 of two Schrbdinger equations with po- 
tentials Vi and V 2 , respectively. Show that 

J ^ [(gg _ V 2 ) - (El - 

for any interval a <x <b. Show that if and 1 A 2 sire solu- 
tions for the same potential and if E 2 > Ei, then ^2 at 
least one node between each pair of consecutive nodes of 
Over the entire interval, then, v ^2 least one more node 

than (The eigenfunctions are ordered in energy according 
to the number of nodes. A set of eigenfunctions is complete 
if there is one for each integral number of nodes.) 

4.6 (Comparison potential for discrete states) Consider the in- 
terval 0 < X < «. with the boundary condition U(0) = 0. Find 
the potential for which U = x" Is a zero energy solution of the 
Schrodinger equation. Show that the potential obtained has no 

discrete eigenfunctions for n real. If w is complex, n = i + 

A 

ij8, show that the potential has arbitrarily many bound (i.e., 
discrete) states. The solution may be taken to be real U = 
Vx cos /3 Inx. 
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4.7 Let Vo be the comparison potential obtained in problem 4.6. 
Show the following: 

(a) If V > Vo for X > xo, there is no discrete eigenfunctions 
of V with a node located beyond xo. 

(b) If V < Vo for X > xo, there is a discrete eigenfunction 
with arbitrarily many nodes beyond xq. There are in- 
finitely many discrete states of V. 

(c) If V > Vo for 0 < X < xo, there Is no solution for the po- 
tential V with E < 0 that has a node in the interval 0 < x < xq. 

(d) If V < Vo for X < xo (i.e., if V - more rapidly than Vq), 
then there are eigenfunctions with arbitrarily many nodes 
in the interval. 

4.8 Scale the function in problem 4.6 to find some eigenfunctions 
of the potential V - for x < a and x > 6, V - -Vo(l/x^ - 
l/b^)aH'^/{b^ - a^) for 0 < a < x < 6. Is there a restriction 
among the parameters Vq, a. and b for this method to work? 
Can one find a complete set this way? 


Chapter 5 

5.1 Let a particle be prepared in the state ^(x,0) = V2a0(xo - x) 
exp{/px f a(x - xo)l at f = 0, where a > 0, xo < 0 and Oix) = 

|(1 + signx). Assume that no forces act on the particle for 

t > 0. Expand iffix.O) in terms of energy- momentum eigen- 
functions and show that for t > 0, 

ih{x t) “ exp(/px - dL 

^ i2ap + iy) Jo 1 ~ izL2 ’ 

_2 

where p = 1it/2m, y ^ x - xq - 2pp, and e = (1 + iy/2afi) . In- 
terpret i/r for t -» oo. 

5.2 Reconsider the initial wave function specified in problem 5.1, 
Assume the particle is in a potential field V = -|Vo|S(x). 
Expand tfrix^O) in terms of the eigenfunctions of problem 4.4 
and follow the motion in the dispersion-free approximation 
(3.20). Compute the probability of the particle being reflected 
by the potential. What is the probability of finding the par- 
ticle in the ground state? 

5.3 The initial value problem of the time- dependent Schrodinger 
equation can be solved in the form of an Integral operator 

with the Green’s fimction kernel Gt(x,x') = ^ ^^(x')*e*'“"‘ 

n 

where is a complete set of orthonormal energy eigen- 
functions. 

(a) Find the differential equation and boundaiy conditions for 
Gt in the case of a general one-dimensional potential. 
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(b) Show that the force- free motion of a non- relativistic par- 
ticle in an s-dimensional Euclidian space is given by 
Gtiry) - {m/2ni7it)^^ expUmir - r')^/21it\^ 

(cf. Problem 3.3 In the text. How can Gt serve as an approx- 
imation to the S-function?) 

5.4 Find the angular dependence and total current of charged 
particles flowing from a source at the origin. The wave 
function far from the source is cosOe^^Vr in spherical co- 
ordinates. 

5.5 A particle is in a uniform gravitational field in a region 
bounded from below by a perfectly reflecting surface. Adapt 
the WKB method to the case of an impenitrable wall and show 
that the energy levels go asin < 3/4)^*^ for large a?. Show that 

- En-\ for large n, where oy is the classical angular 

frequency. 


Chapter 6 

6.1 Find the probability for a particle of charge ^Z'eQ and speed 

V to penetrate through the Coulomb barrier to a nucleus of 

charge (Gamow factor) 

6.2 Derive (6.35), using integration by parts to pick out terms of 
order in 

6.3 Find the pressure exerted by the walls to contain a particle 

in a very deep potential well of volume V at absolute zero 

temperatures. 

6.4 Find a relation between the volume and pressure of an elec- 
tron gas at 0®K. 

6.5 Show that (6.66) holds for an arbitrary periodic potential 
(Block’s theorem). 


Chapter 7 

7.1 Express \/r as an integral operator in the momentum repre- 
sentation. _____ 

7.2 Find the kernels for the expression of and Vp^ v mlc^ as 
integral operators in the coordinate representation. 

7.3 Evaluate the commutator PjcF(r) - Vir)px, 

7.4 Let E(r,p) be a polynomial in the components of r and p. Show 
that 

.. L (Kx - xR) 
dPx t 

and indicate explicitly the meaning of the partial derivative. 

7.5 (a) Find an expression for the operator (A - in powers 

of where A and B are non-commuting operators. 
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(b) Let A and B be N x N matrices and let zi be the root of 
the equation det(A ~ zB) - 0 which is smallest in absolute 
value. Prove that the expansion of part (a) converges 
whenever | ^ | < \zi \ and diverges whenever 1^1 > 1 21 1. 

7.6 Show that in general any quantum mechanical operator can be 
expressed as an integral operator. 

7.7 Find an expression for the translation operator Ta, defined 
by Taij/ix) ^ i//{x + a). 

7.8 The solution of the initial value problem of the time- dependent 
Schrbdinger equation can be expressed in terms of an oper- 
ator Uit) that transforms the wave function at time t - 0 into 
the wave function at time t; that is, - IJ(t) i/y(0). 

(a) Upon what properties of the Schrodinger equation does 
this depend? 

(b) Express Uit) as an integral operator in terms of a com- 
plete set of energy eigenfunctions. 

(c) Evaluate the kernel of the integral operator Uit) for the 
motion of a free particle. 

7.9 The concept of the quantum mechanical average value has its 

natural generalization in the notion of the ‘‘inner product'' in 
the representation space (see Appendix), The average value 
M - is the inner product of m with the vector M'P. 

Demonstrate the following: 

(a) Let I... 00 , be a complete set of basis vectors 

which is orthonormal with respect to the inner product, 

- dik. Show that the action of an operator M is 
completely determined by its matrix elements M^k - 
MV^). 

(b) An operator U is Hermitian if (^,//0) - for all 

vectors 'P and Show that the eigenvalues of /] are real, 
the eigenvectors corresponding to distinct eigenvalues are 
orthogonal, and there exists a basis in which H is diagonal. 
Find the conditions on the matrix elements Hik such that 
H is Hermitian, 

(c) If ('P/W) - 0 for all vectors 'P and if (0 is Hermitian, then 
((j), (Q'P) ^ 0 for all 'P and O. One may then assert the oper- 
ator equation (0 - 0. 

(d) If (W, A<P) = (B'P,^)) for all W and 4), then B is said to be the 
Hermitian conjugate of A and is given the special notation 
B - Find the relation between the matrix elements of 
A and A^, Show that A is the Hermitian conjugate of A^ 
and that a Hermitian matrix is self-conjugate, H = 
Express the Hermitian conjugate of the operator product 
AC in terms A^ and 

(e) Show that if the inner product is positive definite, i.e,, 
(4), 4)) > 0 for all 4> but the null vector, then the eigenvalues 
of AA^ are non-negative. 

(f) An operator IJ is unitary if (U'P,L/4)) - ('P,4)) for all W and 4>, 
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If is an orthonormal basis, then 'Pj = U% Is 

also. Find the conditions on the matrix elements Uik such 
that U is unitary. Show that U is non- singular. 

(g) Show that the operators defined in problems 7.7 and 7.8 
are unitary. 

7.10 If A and B are N xN matrices and if AB = /, show that BA = /, 
I being the N xN unit matrix. Construct a counter example 
to show this result is not true in general for “infinite 
matrices,” i.e., N * 


Chapter 8 

8.1 Show that the commutation relations between the operators 
for the velocity components of a charged particle in a mag- 

netic field are VxUy ~ VyVx ^ Bz and cyclic permutations. 

mlc 

8.2 Show that if and are energy eigenfunctions and if M does 
not depend explicitly on time, then the time dependence of off- 

diagonal matrix elements is givenby — 

dt 

where lioiik Ei - E^. 

8.3 Let Uit) be the operator that generates the solution W(^) of the 
time- dependent Schrodinger equation (with Hamiltonian H) 
from arbitrary initial data 'P(O) in the form ^it) ^ U(t) 'P(O). 
See problem 7.8. 

(a) Show that U{t) satisfies the operator equation itUU) = 
HU(t), with initial data i;(0) = 1. 

(b) Deduce from the differential equation that Uit) is unitary. 

(c) Specify under what conditions and in what sense Uit) may 
be written as Uit) = expi-iflt/W. 

8.4 In the Schrodinger picture observables are represented by 
operators that do not depend explicitly on time, the change of 
dynamic variables with time being described by the change of 
state vector ^^(0. In the Heisenberg picture the time depend- 
ence is transferred to the operators themselves by means of 
the unitary operator in problem 8.2. The procedure is Mit) = 
(Wit),MWit)) - ('Pq,ITL(0^q), where 'Pq = 'P(O) is the state vector 
at / = 0 and where ITK/) - Uit)~^MUit). 

(a) Show thati^tri= lUW - M where U = UiU'^HUit). If H is 
independent of time U = This is the Heisenberg equa- 
tion of motion. 

(b) If AB ~ BA = iC, then QB - BQ =. iC. 

(c) Derive an expression for the time derivative of the op- 
erator product Q(0B(0. 

(d) Find the position operator in the Heinsenberg picture for 
the motion of a free particle. Express the result in the 
coordinate representation. 
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(e) Using the position operator in part (d), find the time de- 
pendence of the variance of the position of a freely moving 
particle. 

8.5 Show that the mean value of the kinetic energy in a state be- 
longing to the discrete spectrum is related t o the m ean value 
of the potential energy by the relation 2T = (r • VV). If ^ is a 
homogeneous function of degree a, V(Ar) = K°-V(r), then 2T = aV 
(virial theorem). 

8.6 Verify the following “sum rules.” 

' 2 

(b) X] I ^ik 1^ = (“F-sum rule*") 
i 2mo 

(C)S4U..|“- 

2/no 

(e) ■= diverges 

i 

Here 'R<o,k = E, - E*. Note that &/,* > 0 if k designates the 
ground state. Sum rules are Important because they provide 
a means of testing the physical content of the theory even 
though the mathematical problem cannot be solved com- 
pletely. 

Chapter 9 

9.1 Formulate the set of coupled differential equations governing 
the population of a large number of quantum states. Show that 
the total number of levels occupied does not change with time. 
Identify the conditions under which the occupation numbers 
are positive definite. Derive the relations among the Einstein 
coefficients which specify the approach to thermodynamic 
equilibrium. 

9.2 Compute the current density of the non-statlonary state 
which describes the transition from the 2p to the Is state of 
hydrogen and calculate the radiation field from this source. 
Identify the angular distribution and state of polarization for 
each value of the magnetic quantum number m. See Chapter 13. 


Chapter 10 

10.1 At what quantum level can a wave packet in a harmonic 
oscillator be localized to 1% of the total excursion? Use the 
uncertainty principle. 

10.2 Find the stationary states of a linear harmonic oscillator in 
a uniform electric field. 
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10.3 Suppose that at t = 0, a particle in the potential field V - 

- mco'^x^ is described by the wave function = const. 

2 

exp |/7zox - ~ where = ma>/^. Calculate the 

probability amplitude for the particle being in each of the 
energy eigenstates. Find the wave function for t > 0. Discuss 
the spreading of the wave packet. 

10.4 Show that ^ H„(0 is the generating function for 

n = o n' 

Tchebycheff- Her mite polynomials. Use it to establish the 
appropriate orthonormality relations. 

10.5 Show that / llm(x)e~^^ dx - y/n , 

•/— 00 

10.6 Infer the matrix elements of the position operator of a har- 
monic oscillator from the spectrum, selection rules and the 
F-sum rule. 

10.7 Solve the Heisenberg equations of motion to find the time- 
dependent Heisenberg operators for the position and mo- 
mentum of a harmonic oscillator. 

10.8 Find the time-dependent Green’s function for the harmonic 
oscillator. Normalize by comparing it to the free particle 
Green’s function at a time nd ^ 1. 

10.9 Study the harmonic oscillator using the operator properties 
of the variables 

a = X I \li7i/2mcj) d/dx , 

y/{m(i)/27i) x - V(^/2maj) d/dx . 

Prove the following statements: 

(a) The Hamiltonian operator is H ^ a + ~ where the energy 

is measured in units of 

(b) The commutation relations 

aa^ ~ a ^ 1 
Ha ~ aH ^ -a 
Ha+ - a^H 

hold as operator equations. 

(c) The Hermitian conjugate (or adjoint) of a is a^. H is 
Hermitian (self-adjoint). 

(d) If Ur is an energy eigenfunction with eigenvalue c, then 
aUr is an eigenfunction with energy e - 1 and (a^)'^Ur is 
an eigenfunction with eigenvalue t + n. This gives a series 
of equally spaced energy levels. 
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(e) There is a finite lowest eigenvalue. 

(f) The series must terminate at the lower end by arriving at 
a ‘Aground state” Uq that satisifes the equation aUo - 0 
(= the null vector). Solve this differential equation for Uq 
and normalize. 

(g) The ‘‘excited states” are with energies 

+ 0 ^w.This gives a complete set of orthonormal eigen- 
functions. 

(h) Determine the matrix elements of a, a^, x and p by alge- 
braic means. Show that there are no finite dimensional 
matrices which satisfy the specified multiplication prop- 
erties. 

Chapter 11 

11.1 Prove the orthogonality and normalization condition of Le- 
gendre polynomials using the generating function (1 - 2rx + 

r 2 )-'A ^ r''F„(x). 

n - 0 

11.2 Derive the commutation relations between the components of 
angular momentum and the components of (a) the position 
operator, and (b) the momentum operator. 

11.3 Let Ax, Ay and A^ satisfy the commutation relations L^Ay - 
AyLjx ~ iTiA^, Lj y A X ~ Ax^^y — — iTiA;^^ LjyAx — Ax Ijx ~ and cyclic 
permutations, with respect to the angular momentum oper- 
ators. Such operators are called “vector operators”; ex- 
amples are r and p. Prove the following relations: 

(a) = 0 

(b) Lx(A • L) - (A . L)L^ 0 

(c) A jc ~ Ajf + L/ Z Ay ~ A^tvy “ FyA^) 

(d) /.^(l'-^A^ - AxL^) - (l^A, - A;,L2)l2 
2(l2a* + A*l2) - 4L*(A-L) 

(e) /(/ ^ \)(A)l‘,Z . (L . A):\(L)\Z. 

11.4 Show that in a state i/j with a sharp value of L^, L^^/J - rrujj^ the 
mean values of Lx and Ly are zero. 

11.5 Suppose a system can be resolved into two weakly interacting 
subsystems 1 and 2 so that the total angular momentum L is 
Li + L 2 . If the subsystems are in states characterized by 
definite values of the quantum numbers /i, and I 2 , hx re- 
spectively, what are the possible values of and what is the 
average value of L^? 

11.6 Express the spherical harmonics for 1 - 0, 1 and 2 as poly- 
nomials in X, y and 2 . 

11.7 Find the transformation rule of the spherical functions Yu, 
Yio and Y 1-1 for a rotation of the coordinate system through 
Eulerian angles a, /I and y. 
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11.8 Let =■ Lx ± iLy. Verify the following relations: 

(a) (Lr - Lt 

(b) l2 = l 2 + i(L+L- + L-L+) 

(c) - iSLj 

(d) L+L- - L-Lh- - 2tL^ 

(e) - L^l2 - 0 

(f) L^Lx - LzL^ = 0 

11.9 Study the way in which the multiplication properties in prob- 

lem 11.8 can be represented in terms of matrices, or what is 
the same thing, linear transformations on a finite dimensional 
vector space. Let ^ finite set of degenerate eigen- 

functions of with eigenvalue A. Here m labels the elements 
of the set. Show the following: 

(a) Lz is an eigenfunction of L^. 

(b) There is some linear combination of the 'P’s which is an 
eigenfunction of L^. Denote the eigenvalue by mt and the 
particular linear combination by 'Pxm* 

(c) L^^X/n eigenfunction of with eigenvalue A. 

(d) is an eigenfunction of with eigenvalue (m ± 1)^, 

(e) From the condition that the norm of nonnegative, 

it follows (respectively for +) that A > m(m ± 1)^^, the 
equality holding if the vector is null, 

(f) The condition for the existence of finite multipletes is 

that the series generated by terminates at both ends. 
This Implies that A has the form /(/ + where / may 
have the values 0, 1/2, 1, 3/2, ..., and that m runs over 
range - - / + 1, 1, /, 

(g) Find the matrices of order 2/ + 1 that represent the action 


of the operators and on the vector space spanned by 
^ ^ fixed value of I, For example, 

^ Select a phase convention 


compatlable with (11.88). 


(h) Write out the explicit matrices for / = 1/2 and verify the 


commutation relations by direct matrix multiplication. 


Chapter 12 

12.1 Find the energy levels and eigenfunctions of an Isotropic 
three-dimensional harmonic oscillator. Find energy eigen- 
functions that are also eigenfunctions of and L^. What is 
the degeneracy of each level? What part of the degeneracy 
stems from Isotropy of the potential? 

12.2 Write out the explicit / = I, m = 1, 0, - 1, eigenfunctions of 
lowest energy of a three dimensional spherically symmetric 
harmonic oscillator. Express these eigenfunctions as linear 
combinations of the solution obtained by using spearation of 
variables in Cartesian coordinates. If each triplet of wave 
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functions is orthonormal, show that the matrix of coefficients 
of the transformation is a unitary matrix. 

12.3 For the isotropic potential Vir) - -g8ir - a), determine the 
range of g for which there is an / = 0 discrete state but no 
I = 1 discrete state. 

12.4 In the case of a continuous spectrum, the wave function far 
from the scattering center is characterized by the ‘‘phase 

shift^^ 8iik) defined as follows: rR^^ir) c sin ~ ~ 

for kr » I {kr » 1 for / = 0), where tk = \l2mEk. For a “short 
range“ potential, 8i is Independent of r (see page 222 and 
Chapter 28B), Calculate the / - 0 phase shift for the potential 
in problem 12.3. 

12.5 The combination = i/^p) occurs frequently and is 


The combination occurs frequently and is 

called the “spherical Bessel function. “ The second solution, 
irregular at the origin p = 0, is n/(p) = con- 

sistent with the definition of Neumann functions, /Vp =r (JpCOSnp - 
J-p)/sin7rp. Verify the following properties: 

(a) differential equation, 




(b) explicit form, 


sin p cos p 


cos p sin p 


(c) recurrence relations, 

}u\ + U-i = —J—U' h = 

(d) asymptotic form (p » 1, p » 1), 


/ + 1 . 

-TT-h + h.i 


j^ip) ^ j-sin^p - 
n^ip) ^ _lco8^p - 


(e) behavior at p = 0, 


j (p) ^ — 2J ! — pi 
' ^ (21 + D! ^ 



524 


PROBLEMS 


rii^p) 


,-'-1 

21 r. ' 


(f) integral representation (see page 202), 




^ £. 


P,{x)e‘P'‘dx 


(g) orthonormality, 


f jjikr)j^{k'r)r^dr 

n 


2kk' 


8(k k ) 


12.6 Find the conditions under which the spherical potential well 
V __ -1 Vo I for 0 r ‘ a, and V = 0 for r > a, can support N 
s-wave bound states, but not /V i 1. 

12.7 In the limit of zero kinetic energy, an s-wave phase shift 

behaves as, - -ka, where a is the “scattering length/’ 

Find an approximate relation between scattering length and 
the energy of a loosely bound state, Fjy - 0. 


Chapter 13 

13.1 Find the momentum distribution of an electron in the Is, 2s 
and 2p states of hydrogen. 

13.2 Calculate the lifetime of a hydrogen atom in the 2p state. 

13.3 Give an expression for the transition rates between consecu- 
tive circular orbits in a hydrogen-like atom. 

13.4 Show that (1 - t) exptx^/(l ~ t)\ ^ (Id) ^Lkix)t^^ is the gen- 

k=0 

crating function for Leguerre polynomials L//x) - 

13.5 At what quantum level is a muonic atom the same size as a 
normal hydrogen atom in its ground state? 

13.6 What is the probability of finding an electron with quantum 
numbers n, I inside a nucleus of radius 

13.7 Show that if the energy is regarded as a complex variable, the 
Coulomb “scattering amplitude’’ for a definite value of /, 

- V has a simple pole singularity at the energy 
value corresponding to each level of the discrete spectrum. 

Chapter 14 

14.1 Evaluate the shift in energy levels of a harmonic oscillator 
produced by a perturbing 6-function potential that is centrally 
located. State the limit of validity of the approximation. 

14.2 Calculate the lowest order effect on the spectrum of a linear 
harmonic oscillator due to the relativistic increase in mass 
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of the particle. In what circumstance is the relativistic effect 
pronounced? When is the perturbation approximation valid? 
Cf. problem 2.10. 

14.3 Treat the Stark effect of the n - 3 level of hydrogen. Deter- 
mine the pattern of splitting and residual multiplicity by 
symmetry arguments. How does the secular equation factor 
in the representation in which is diagonal? What linear 
combinations of unperturbed eigenfunctions are energy eigen- 
functions in the presence of the field? 

14.4 Evaluate the static 0) electronic polarizability of a hy- 
drogen atom in its ground state. 

14.5 For the continuous part of a spectrum, the states may be 
labeled by the eigenfunctions of the free particle Hamiltonian 
Hq and an (exact) integral equation for the eigenfunctions 
W|^(r) of the Hamiltonian Ho f V set up along the pattern of 
perturbation theory. The integral form of the Schrodinger 
equation is 

'P^(r) - expjk-r + J'GE(ry)V(r')\{T')d^r', 


where Ge is the energy Green’s function 


Gr (r,F) 


exp ik\r - t'\ 
4n\r - F I 


characteristic of the free particle Hamiltonian. The Green’s 
function can be defined in terms of a complete set of energy 
eigenfunctions 0,j(r) of the (any) Hamiltonian Ho as follows: 


GE(r,rO 


IT r.t 


(a) Formulate a differential equation and boundary conditions 
for the Green’s function. 

(b) Evaluate the explicit form of the free particle Green’s 
function from its definition in terms of a complete set of 
energy eigenstates. 

(c) Find the relation between the energy Green’s function and 
the time Green’s function Gt. (See problem 5.3.) 

(d) The effect of a “short range” potential on the wave func- 

tion at large distances from the scattering center is de- 
scribed by the “scattering amplitude defined by the 

asymptotic form (for r . oo) 


pikr 

\{r) - f fiO) ^ . 

r 

(See Chapter 28, Section B) Derive an exact expression for 
f(0) in terms of the exact solution of the Schrodinger 
equation. 
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(e) Formulate a perturbation approximation for and f(0) on 
the basis of the above results. 

14.6 For a definite value of the angular momentum the integral 
from of the Schrodinger equation for a spherically symmetric 
potential takes the form 


Ufii(r) = sin kr ^ 


2mo 

'W 



gEi(rr)V(nuki(ndr\ 


where 


gEO (ry) 



su\k\r + r‘ 


'i] 


for I = 0. 

(a) Evaluate gEi(ry) for / > 0. 

(b) What is the relation between the wave functions and be- 

tween the Green’s functions of the three-dimensional 
(problem 14,5) and the partial wave formulations. It Is 
necessary to distinguish ingoing and outgoing wave bound- 
ary conditions The addition formula for spherical 

harmonics is helpful, 


P/(co8 8) == 


4n 

W71 





where cos© = cos^ cosd' + sin^ sin^' cos(<^ - 0'). 

(c) An exact expression for the s-wave phase shift is 


^tan^Q ^ - 


2mo n 

^2 Jo 


sin kr V (r) (r) dr . 


Verify this relation and generalize it for I > 0. What nor- 
malization condition is implied for u^? See problem 12.4. 


Chapter 15 

15.1 Identify the conditions under which the KLein-Gordan equation 
reduces to the Schrodinger. How does the initial data prob- 
lem for the second order differential equation reduce to that 
of the first order equation? 

15.2 Show that the function- space inner product 

is independent of time if if> and <f> are solutions of the Klein- 
Gordan equation. Find the form of the inner product in mo- 
mentum space. Is the inner product positive definite? 
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15.3 A general solution of the Klein-Gordan equation is a super- 

position of a positive-frequency and a negative-frequency 
part, with time dependence and respectively. 

Show that under a Lorentz transformation that does not 
reverse the sense of time, a positive-frequency solution is 
transformed into a positive- frequency solution, i.e., that the 
decomposition is Lorentz invariant. 

15.4 Find a complete set of positive- frequency solutions of the 
Klein-Gordan equation that are orthonormal with respect to 
the inner product in problem 15.2. Show that the complete- 
ness relation (for unequal times) is 


L fjx) fjx')* 

a 


1 f^-ikix-x') 

(2n)^ J 2<u 


where &) = (u(k) = c\/fe2 + and where the Minkowski 

space Inner product is k(x -x') = w(< - - k • (x - \). 


Chapter 16 

16.1 What Is the value of the g factor of a particle described by 

the Hamiltonian (l/2mo) • (p - (e/c)A)J^? 

16.2 A particle of spin 1/2 and magnetic moment n moves in a 
precessing magnetic field 

Hx = a sin0cos<ut, Hy = sin0sinci)t, Hg = U cosO. 

At time t - 0, the spin is parallel to the z axis. What is the 
probability of the spin being antiparallel to the z axis at some 
later time? 

16.3 Find the (time-dependent) position operators in the Heisen- 
berg representation of a particle of spin 1/2 and magnetic 
moment n moving in a nonhomogeneous magnetic field 

Hx = 0, Hy = -ky , Ha = Ho + ftz. 

16.4 Determine the energy spectrum and wave fimctions of a charged 
particle moving in uniform electric and magnetic fields that 
are perpendicular to one another. 


Chapter 17 

17.1 Show that the four matrices (/',o') are Hermltian, linearly 
Independent and form a complete basis for 2x2 matrices. 

17.2 How many matrices are required to form a complete set of 
Hermltian, linearly independent NxN matrices? How many 
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mutually commuting, Hermitian, linearly independent NxN 
matrices are there? 

17.3 In the Dirac theory, a and p are mutually commuting linear 
operators. Develop a notation for the linear vector space on 
which the operators act. Find an expression for the inner 
product in the composite (direct product) space. Show that 
the Dirac Hamiltonian is Hermitian. 

17.4 Use the Heisenberg equation of motion to show 

(a) -^(x f ih p.^a/2mc) -= - (e/c)^/m , 

(b) ~ I ih p^/2mc^) -- - e^)/mc^ . 

17.5 What transformation of the spinor wave function must be made 
to restore the form of the Dirac equation after a change of 
gauge in the electromagnetic potentials? 

17.6 Show that the form of the Dirac equation remains unchanged 
by Lorentz transformation or spatial rotation. 


Chapter 18 

18.1 Suppose a system consists of two weakly interacting sub- 
systems each of spin 1/2. The total spin is S = si + S2. What 
are the possible eigenvalues of and S^? Compute the value 
of • S 2 in the triplet (spins parallel) and singlet (spins 
antiparallel) states of the composite system. Find eigenfunc- 
tions of and as linear combinations of products of eigen- 
functions of the subsystems. 

18.2 An electron moving in a central field of force is in a state 
specified by the quantum numbers Ijmj, What are the possible 
values of the ^ components of orbital and spin angular mo- 
mentum and what is the average value of each? 

18.3 Is the parity operator / a linear operator? Is it Hermitian? 
What are the commutation properties of / with the operators 
r, p, L,, S and J? 

18.4 An axial vector transforms as a vector under proper rota- 
tions (i.e., rotations without space inversion) but does not 
change sign under inversion. Classify the following as being 
either vector or axial vector: K, H, A, S, L, p and u x v 
where u and v are vectors. 

18.5 Show that if a system is in a state characterized by a sharp 
value rrij for the e component of the total angular momentum, 
the mean value of the total angular momentum about an axis 

making an angle 0 with the e axis is nij cos 6^. 

18.6 Evaluate the particle flux of positive energy and negative 
energy plane wave solutions of the Dirac equation. Also cal- 
culate the flux of the corresponding charge conjugate solutions. 

18.7 Can the charge conjugation transformation be represented by 
a linear operator? 
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Chapter 19 

19.1 Evaluate p and j for a Dirac particle in an electromagnetic 
field. Verify that the Lorentz force is given correctly. 

19.2 Calculate the spin-orbit and contact potentials for an elec- 
tron outside a closed atomic core consisting of a hydrogen- 
like atom. 

Chapter 20 

20.1 Calculate the energy level splitting of a hydrogen atom in a 
weak electric field. Neglect the Lamb shift and assume the 
Stark effect is small in comparison with the fine structure. 
Account for the latter by using eigenfunctions of Jz andL^ 
for the unperturbed states. 

20.2 Evaluate the Stark effect for the n - 2, j = ^ level of hydrogen 

for the case where the Stark effect is comparable with the 
Lamb shift. Represent the Lamb shift by a phenomenological 
perturbation matrix element that touches only the s- state. 

20.3 Study the hydrogen Stark effect for the level n -- 2 in the 
transition region where the Stark effect and the fine structure 
are of the same order of magnitude. Plot the energy levels 
as a function of P, 

20.4 Compute the lifetimes of the 2pt, and 2p3/ states of hydrogen, 

/2 12 

20.5 Find an expression for the lifetime of the metastable 2s~ 

state of hydrogen in a very weak electric field. At what field 
strength will the lifetime by 10"'^ sec? Is the Stark level shift 
appreciable at this field strength? Hint: Evaluate the matrix 

element of r using a perturbed 2si wave function that con- 
tains an admixture of the 2p^ state and take advantage of the 
smallness of the Lamb shift. 

20.6 Find the mean value of the operator p - I. + in a state 
characterized by the quantum numbers J, L, S. (This 
gives a generalization of the Lande formula in the case where 
the g factor of the electron is not exactly equal to two. See 
page 346.) Hint: Use problem 11.3(e). 

Chapter 21 

21.1 Determine the form of the contact interaction when the finite 
nuclear size is taken into account. Calculate the splitting of 

the 2s-^---2pl levels of hydrogen stemming from this effect. 

It is about - 0.1 Me. 
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21.2 Calculate the hyperfine splitting of the ls| state of hydrogen 

using a classical model in which the nucleus is represented 
by a uniformly magnetized sphere of radius R/y. Give a 
classical explanation of the sign of the splitting. 

21.3 Find the magnetic field at the nucleus of an s state electron. 

21.4 Consider a hydrogen atom in the ground state in a uniform 
magnetic field. Find the appropriate linear combinations of 
electron- proton spin wave functions that are energy eigen- 
functions in the case where the interaction with the external 
field is the same order of magnitude as the electron- proton 
dipole- dipole interaction. 

21.5 Calculate the hyperfine splitting of a hydrogen-like atom in a 
state of non- zero orbital angular momentum. Hint: Use 
problem 11.3(e). 

Chapter 22 

22.1 Specify a complete set of commuting constants of the motion 
for posltronium. 

22.2 Calculate the fine structure of posltronium. Obtain the 
Hamiltonian by semlclasslcal arguments. Account especially 
for ‘‘hyperfine’’ splitting, knowing that the magnetic moment 
of the positron is equal in magnitude and opposite in sign to 
that of the electron. Present the results in an energy level 
diagram for n == 1,2, 

22.3 Evaluate the Lamb shift for posltronium. 

22.4 Construct a theory for posltronium using a Dirac Hamiltonian 
for each particle and the Coulomb interaction between them. 
Separate the motion of the center of mass from the relative 
motion in the approximation of retaining only the lowest order 
relativistic corrections for the relative motion. Does the 
“hyperfine” interaction emerge automatically? Hint: The 
action of spin operators on a product wave function can be 
expressed as follows: 

± X (f> X xfj^ ± ^0 X 

Chapter 23 

23.1 Construct a complete set of orthonormal two-particle eigen- 
functions from a complete set of one-particle eigenfunctions. 

23.2 Derive the Hartree-Fock equations for determining the best 
single-particle functions to give an antisymmetric (or sym- 
metric) two-particle wave function of lowest energy. 

23.3 Calculate the n - 1 and 2 levels of a hydrogen-like atom, in- 
cluding fine structure and the Lamb shift, using a variational 
method. 
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23.4 Show how the variational method with trial functions yJr = 

can be reduced to a matrix procedure if the 4's are 
regarded as the variational parameters. The a*s may be 
selected by Intuition or by an itteration scheme. Note how 
this gives a finite set of orthonormal vectors. 

23.5 Evaluate the ground state energy of helium using the follow- 
ing “ self-consistent’ ' variational method: assume the first 
electron is in a hydrogenic Is state with Z = Z' and calculate 
the screened field seen by the second electron. Describe the 
second electron by a hydrogenic Is function with Z = Z\ 
Vary Z'^ to find the lowest energy for a given z'. Then impose 
the symmetry by setting Z' - Z"^. 

Chapter 24 

24.1 Give the possible values of the total angular momentum for 

the following states (terms): ^S. ^P. ^P. and'^D. 

24.2 Which terms are possible for the following two-electron con- 
figurations: (a) nsn's, (b) nsn'p, (c) nsn'd, and (d) np, n'p? 
Which terms are consistent with the exclusion principle if 
n^n'? 

24.3 Couple three unit angular momenta /i ^ /2 = /a 1 to yield a 

resultant eigenstate L = 1, = 0 of where L = li + I 2 + I 3 . 

How many Independent states of this sort are there? 

24.4 Estimate the low-lying excited state energies of helium in 
the approximation of neglecting exchange effects. Do this 
assuming that one electron is described by the Is function 
found in the ground state calculation and carrying out the 
variational procedure for a hydrogenic 2s (and independently, 
2p) wave function in the screened Coulomb field. The 2s state 
must be taken orthogonal to some appropriate Is state. The 
root of the variational equation may be found by a rapidly 
converging Iteration method. 

24.5 Calculate the low-lying excited state energies of helium taking 
into account exchange and spin effects. Use appropriately 
symmetrized product eigenfimctions for the configurations 
(Is, 2s) and (Is, 2p), where the Is function is that found in 
the ground state calculation. The required matrix elements 
are given in problem 24.6 for reference, 

24.6 Consider the hydrogenic wave functions 

- 2a^^^e-'^Yoo 
'P2s - b(l - Br)e-^^Voo 


'P 2 p = cre^'^Yim 

where a = Zu/ao, /3 = Z2s/2oo, and y = Z2p/2ao and where b, B 
and c are determined by the orthogonality and normalization 
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conditions. Verify the following one- and two-particle matrix 
elements: 


T2p ^ ('P2p. TVap) - 


tjl 

2mo 


V-i. 




K ( l.s2p) 


Cq y^ia^ + 5ay + lOy^) 
6(a + y)"^ 


A(ls2p) -- 


286^2 a2y2 

3 (a t- yV‘^ 


T 2 . 



mo \ 6 

- aft + p‘^/ 

V2s 


P^ia - 2/3) ' 
~ ap^ 


a2_ _ 

a^p(2a 

(a + 

2(a 1 - pV'Ha^ 


K(ls2s) 

afi + jS^) , 

4ej{a^B'' 

^(l.s2s) .- ' 

(a + P)^(a^ - afi + 0^) 

24.7 Calculate the ground state of lithium by means of a variational 
method based on an antisymmetrized product wave function 
for the configuration (Ls^2.s). The appropriate matrix elements 
can be reduced to those given in problem 24.6. 


Answer: The best variational wave function is given by 
Zis = 2,694 and Z2s = 1.534. The energy is 
- 200.8 eV compared with the perturbation theory 
value - 190.8 eV and the observed value - 202.54 
eV, 

24.8 Calculate the (ls^2p) excited state of lithium by the variational 
method, (a) neglecting exchange effects and assuming a 
helium core, (b) neglecting exchange effects and using the 
Ls^ core found in problem 24.7 and (c) taking exchange effects 
into account using the antisymmetrized wave function for the 
configuration (Ls^2p), 


Chapter 25 

25.1 Show that the virial theorem 2T 4 + ^71-0 - 0 is fulfilled 

in the Thomas- Fermi model. Does it also hold in the varia- 
tional solution? 
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25.2 Using the virial theorem, show that in the Thomas-Fermi 
model of a neutral atom the energy of electrostatic repul- 
sion between the electrons is 1/7 the magnitude of the elec- 
trostatic attraction between the electrons and the nucleons. 

25.3 Estimate the order of magnitude of the following quantities 
in a neutral atom according to the Thomas-Fermi model: 

(a) the size of the atom, 

(b) the average electrostatic repulsion between two electrons, 

(c) the average kinetic energy of one electron, 

(d) the average speed of an electron, 

(e) the average angular momentum of an electron, 

(f) the mean radial quantum number. 

25.4 Show that the mean perturbation of all states of a given term 
is zero for the spin- orbit interaction. 

25.5 Calculate the L • S splitting of the term of helium. 

25.6 Evaluate the L • S splitting of a single np electron in a spheri- 
cally symmetric potential. Express the result in terms of an 
arbitrary radial matrix element. Estimate the radial matrix 
element for the doublet splitting of sodium. 

25.7 Two electrons move in an (np)'^ configuration in a spherically 
symmetrical potential. Regarding the electrostatic repulsion 
between the electrons as a perturbation, evaluate the splitting 
of terms in first order approximation. Neglect the spin- 
orbit interaction. Use qualitation considerations to infer 
ordering of the terms. Hint: To facilitate diagonallzation of 
the secular equation, use a representation in which and Ms 
are diagonal and note that the sum of the roots of a secular 
equation is equal to the sum of the diagonal matrix elements. 


Chapter 26 


26.1 Find the possible atomic terms (a term is characterized by 
L and S) in a configuration of two equivalent d-electrons. 
Give the total number of states and the number of states in 
each term. What values of J are possible to each term? 

26.2 In an atomic configuration the term that has the lowest 
energy can be determined by Hund's semi-empirical rules: 

(1) The ground state will have the largest value of S con- 
sistent with the Pauli principle, 

(2) L will have the largest value consistent with the value of 
S determined in rule (1), 

(3) The total angular momentum of the ground state is J = 
IL - SI if the unfilled subshell is half full or less than 
half full; it is J - L 4 S if the subshell is more than half 
full. 

Give a qualitative physical justification for each of these rules. 
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26.3 Using Hund’s rules (see problem 26.2), find the ground state 
of the configuration np^ for x = 1, 2, . . . , 6. For each value 
of X, state an element for which this case is realized physi- 
cally. 

26.4 Do problem 26,3 for the configuration nd^ for x = 1, 2, . , , , 10. 


Chapter 27 

27.1 Estimate the relative frequencies and separation of energy 
levels for the electronic, vibrational and rotational motions 
of a diatomic molecule, 

27.2 Derive the Schrddlnger equation describing the motion of the 
nuclei of a diatomic molecule in the approximation that the 
nuclei move much more slowly than the electrons and thus 
experience only an Interaction with the electrons that is 
averaged over many electron revolutions. This procedure 
provides a separation of variables between the electronic and 
nuclear motions. The approximation is called the adiabatic 
or Bom-Oppenhelmer approximation. 

27.3 What are the possible symmetry states of the diatomic mole- 
cules D 2 , N 2 , LiH formed from the bonding of the two atoms 
in their ground states? 

27.4 What spin symmetries are possible for the rotational states 
of the deuterium molecule D 2 in the electronic ground state? 
The deuterium nucleus has spin 1. 

27.5 Calculate the energy^of a rigid electric dipole in a uniform 
electric field. Use second order perturbation theory. 

27.6 Show that the force between two hydrogen atoms in their 
ground states varies as l/R*^ if the atoms are separated by a 
large distance R. 

Chapter 28 

28.1 The rate for making transitions from an Initial state i to the 
final state f is 


where Pf is the number of final states per unit energy interval 
(‘‘Golden Rule No. 2^’). 

(a) Find the expression for U^f in first order perturbation 
theory. What normalization convention is implied for the 
wave function? 

(b) Derive an expression for U'lf in second order perturbation 
theory. 
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(c) Evaluate the density of states for a final state consisting 

of two free particles of definite total energy and momentum. 

(d) Evaluate Pf for a final state consisting of three equal mass 
(free) particles. Express the results in the center of mass 
system. How is the partition of energy among the three 
particles accounted for by a probability distribution? 

(e) Calculate the dependence of Pf on energy near the energy 
threshold for an /V-particle final state. 

28.2 Calculate the rate for induced transitions from state m to 
state n of an atomic system in an electric field with spectral 

density 8 (&>) at transition frequency "u = - E„). Identify 

n 

the Einstein coefficient B and thereby infer the rate for 
spontaneous emission. 

28.3 Consider a particle of mass m bound in a three-dimensional 

harmonic oscillator potential The particle is irradiated 

with a mild pulse U' = for the time interval < 

t < oo. Determine which transitions are possible and calculate 
their probabilities. Identify the limiting cases of sudden and 
adiabatic perturbations. State the limits of validity of the 
perturbation calculation. 

28.4 At t = 0, a hydrogen atom in its ground state is irradiated 
with a uniform periodic electric field. Determine the mini- 
mum frequency of the field necessary to ionize the atom and 
compute the ionization probability per unit time. As an ap- 
proximation, the electron in the final state may be regarded 
as free. State the limits of validity of the approximation. 

28.5 Show that the scattering of slow particles in a short range 

potential is characterized by 5^ (fe) ~ Find the proportion- 

ality constant in Born approximation and state the conditions 
imder which the approximation is valid. 

28.7 Taking into account the symmetry of the wave function, give 
the differential cross section for elastic Coulomb scattering 
of an electron on an electron and of an alpha particle on an 
alpha particle. Distinguish spin states and also give the 
formula for the scattering of unpolarized electrons. Identify 
quantum effects and show how they disappear in the classical 
limit. For reference, the exact scattering amplitude for a 
fixed Coulomb potential is 

f iO) = - iy) ^ ^-2tyln Rin $/2 

2sin2| r(l ~ ly) 

where y = e^/tv^ Cf. Chapter 13, Section D. 

28.8 Compute in Born approximation the differential scattering 
cross section for the scattering of fast neutrons by a Coulomb 
field. 
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28,9 Set up coupled Schrodinger equations to describe the “two 
channer' reaction and scattering processes 

(11) a + b * a + b, 

(12) a + b * c + d, 

(22) r ] d ^ c ^ d, 

(21) c + d a + b. 

Assume the particles are spinless and of unequal mass. 
Separate out the motion of the center of mass. 

(a) How must the potentials 1 ^ 21,^22 be related in 

order to describe a system that is invarient with respect 
to reversal of the sense of time? How are the cross sec- 
tions for reactions (12) and (21) related? (detailed balance) 

(b) Give expressions for the effective cross sections for these 
reactions in Born approximation. 

(c) Formulate an expression for the reaction and scattering 
amplitudes in terms of partial wave amplitudes of definite 
angular momentum. What relation among fuJi 2 ^ 21*^22 (for 
a definite value of /) is implied by conservation of prob- 
ability? 

Note: The results are presented most simply in terms of a 
matrix notation. The many-channel generalization of the 
quantity is called the 8-matrix. 


Chapter 29 

29.1 Verify the anticommutation relations 

satisfied by the second quantized Dirac wave function. The a 
and p indicate components of the Dirac spinor and the 
unit 4x4 matrix. The positive-frequency parts of <// and i//^ 
describe the creation of positrons and electrons, respectively, 
while the negative- frequency parts describe the annihilation of 
electrons and positrons, respectively. Electrons and positrons 
are described by independent sets of mutually anticommuting 
creation and annihilation operators 

C^e)C^p) ^ 0 , 
n n n n ’ 

c(p)c^v)* ^ = 0 . 

n n n n 

29.2 Evaluate the commutation relations between components of 

the current and charge density of a quantized Dirac wave 
field, i.e., evaluate at equal times. 
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29.3 Calculate the Born approximation matrix element for the 
production of an electron- positron pair in an external electric 
field 

0(r,^) = COS kx COS a)t , 

29.4 Express the energy and momentum operators of a quantized 
Dirac wave field in terms creation and distruction operators. 

29.5 Let M(x) be a massive pseudoscalar meson field satisfying 
the Klein-Gordan equation. Calculate the force between two 
stationary Dirac ^‘nucleons’’ in second order perturbation 
theory, assuming the interaction energy between the ‘'nu- 
cleon'* and meson fields is 

H' = f tfj^ix) p^p^tl/ix)M(x)d^ X . 











